[1] |
Li Ya-chao,Xiong De-yi,Zhang Min. A survey of neural machine translation[J].Chinese Journal of Computers,2018,41(12):100-121.(in Chinese)
|
[2] |
Kalchbrenner N, Blunsom P.Recurrent continuous translation models[C]∥Proc of the 2013 Conference on Empirical Methods in Natural Language Processing,2013:1700-1709.
|
[3] |
Sutskever I, Vinyals O,Le Q V. Sequence to sequence learning with neural networks[J].arXiv:1409.3215,2014.
|
[4] |
Bahdanau D,Cho K,Bengio Y.Neural machine translation by jointly learning to align and translate[J].arXiv:1409.0473,2014.
|
[5] |
Vaswani A,Shazeer N,Parmar N,et al.Attention is all you need[C]∥Proc of the 31st International Conference on Neural Information Processing Systems, 2017:6000-6010.
|
[6] |
Cho K, van Merrinboer B, Gulcehre C,et al.Learning phrase representations using RNN encoder-decoder for statistical machine translation[J].arXiv:1406.1078,2014.
|
[7] |
Razavian A S, Azizpour H,Sullivan J,et al.CNN features off-the-shelf:An astounding baseline for recognition[C]∥Proc of the 2014 IEEE Conference on Computer Vision and Pattern Recognition,2014:806-813.
|
[8] |
Weston J,Chopra S,Bordes A.Memory networks[J].arXiv:1410.3916,2014.
|
[9] |
Artetxe M,Labaka G,Agirre E,et al.Unsupervised neural machine translation[J].arXiv:1710.11041,2017.
|
[10] |
Lample G,Conneau A,Denoyer L,et al.Unsupervised machine translation using Monolingual corpora only[C]∥Proc of International Conference on Learning Representations,2018:10-14.
|
[11] |
Lample G,Ott M,Conneau A,et al.Phrase-based & neural unsupervised machine translation[J].arXiv:1804.07755,2018.
|
[12] |
Zhang Z,Liu S,Li M,et al.Joint training for neural machine translation models with Monolingual data[C]∥Proc of National Conference on Artificial Intelligence,2018:555-562.
|
[13] |
Barone A V.Towards cross-lingual distributed representations without parallel text trained with adversarial autoencoders[C]∥Proc of the 1st Workshop on Representation Learning for NLP,2016:121-126.
|
[14] |
Yang Z,Chen W,Wang F,et al.Improving neural machine translation with conditional sequence generative adversarial nets[J].arXiv:1703.04887,2017.
|
[15] |
Devlin J,Chang M,Lee K,et al.BERT:Pre-training of deep bidirectional transformers for language understanding[J].arXiv:1810.04805,2018.
|
[16] |
Lample G,Conneau A.Cross-lingual language model pretraining.[J].arXiv:1901.07291,2019.
|
[17] |
He K,Zhang X,Ren S,et al.Deep residual learning for image recognition[C]∥Proc of the 2016 IEEE Conference on Computer Vision and Pattern Recognition,2016:770-778.
|
[18] |
Taylor W L.“Cloze procedure”:A new tool for measuring readability[J].Journalism & Mass Communication Quarterly,1953,30(30):415-433.
|
[19] |
Sennrich R, Haddow B, Birch A, et al.Neural machine translation of rare words with subword units[J].arXiv:1508.07909,2015.
|
[20] |
Chen Y,Liu Y,Cheng Y,et al.A teacher-student framework for zero-resource neural machine translation[C]∥Proc of the 55th Annual Meeting of the Association for Computational Linguistics,2017:1925-1935.
|
[21] |
Cheng Y,Tu Z,Meng F,et al.Towards robust neural machine translation[C]∥Proc of the 56th Annual Meeting of the Association for Computational Linguistics,2018:1756-1766.
|
[22] |
Papineni K,Roukos S,Ward T,et al.BLEU:A method for automatic evaluation of machine translation[C]∥Proc of the 40th Annual Meeting of the Association for Computational Linguistics,2002:311-318.
|
[23] |
Kingma D P, Ba J L.Adam:A method for stochastic optimization[C]∥Proc of the 3rd International Conference on Learning Representations,2015:1-15.
|
[24] |
Hendrycks D, Gimpel K.Bridging nonlinearities and stochastic regularizers with Gaussian error linear units[J].arXiv:1606.08415,2017.
|
|
附中文参考文献:
|
[1] |
李亚超,熊德意,张民.神经机器翻译综述[J].计算机学报,2018,41(12):100-121.
|