• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2026, Vol. 48 ›› Issue (2): 330-340.

• Artificial Intelligence and Data Mining • Previous Articles     Next Articles

Data augmentation-based emotion recognition in conversation

TIAN Yu,LI Junhui,ZHU Suyang,ZHOU Guodong   

  1. (1.School of Computer Science & Technology,Soochow University,Suzhou 215006;
    2.Computing Science and  Artificial Intelligence College,Suzhou City University,Suzhou 215104,China)
  • Received:2024-11-21 Revised:2025-04-13 Online:2026-02-25 Published:2026-03-10

Abstract: Emotion recognition in conversation aims to classify the emotion of each utterance within a conversation. However, the label distribution in most datasets often exhibits significant imbalance. To address this issue, a data augmentation approach is proposed to enhance the model performance under conditions of label imbalance. Specifically, large language models are utilized to generate additional samples through back-translation, paraphrasing, and dialogue generation. The samples are filtered based on the harmonic mean of cosine similarity and self-Levenshtein distance. Experimental results on many  datasets show that this method improves model performance in imbalanced datasets, achieving gains in weighted-F1 scores and the recognition of minority labels compared to other state-of-the-art models.


Key words: data augmentation, emotion recognition in conversation, large language models