• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2025, Vol. 47 ›› Issue (1): 180-190.

• Artificial Intelligence and Data Mining • Previous Articles    

Document-level neural machine translation based on rhetorical structure

JIANG Yunzhuo,GONG Zhengxian   

  1. (School of Computer Science & Technology,Soochow University,Suzhou 215008,China)
  • Received:2023-10-30 Revised:2024-03-18 Online:2025-01-25 Published:2025-01-18

Abstract: Despite years of development and significant progress in document-level neural machine translation, most efforts have focused on building effective network structures from a model perspective by utilizing contextual word information, neglecting the guidance of cross-sentence discourse structure and rhetorical information for the model. Addressing this issue, under the guidance of Rhetorical Structure Theory, a method for separately encoding discourse units and rhetorical structure tree features is proposed. Experimental results show that the proposed  method enhances the encoders ability to represent discourse structure and rhetorical aspects. The improved model surpasses several high-quality baseline models, achieving notable improvements in translation performance across multiple datasets. Additionally, significant improvements in translation quality are demonstrated through the proposed quantitative evaluation method and human analysis.

Key words: neural machine translation, discourse analysis, document-level translation, rhetorical structure theory