• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2022, Vol. 44 ›› Issue (12): 2246-2254.

• Artificial Intelligence and Data Mining • Previous Articles     Next Articles

A speech emotion recognition method using mixed distributed attention mechanism and hybrid neural network

CHEN Qiao-hong,YU Ze-yuan,JIA Yu-bo   

  1. (School of Information Science and Technology,Zhejiang Sci-Tech University,Hangzhou 310018,China)
  • Received:2021-03-19 Revised:2021-06-25 Accepted:2022-12-25 Online:2022-12-25 Published:2023-01-05

Abstract: Aiming at the problem that there are many irrelevant features and low accuracy in the existing speech emotion recognition, a speech emotion recognition method based on mixed distributed attention mechanism and hybrid neural network is proposed. The method  is in two channels, and the convolutional neural network and bidirectional short and long-time memory network are used to extract the spatial and temporal features of speech respectively, Then, the outputs of the two networks are used as the input matrix of the multi-head attention mechanism. At the same time, considering the low-rank distribution problem of the existing multi-head attention mechanism, the attention mechanism calculation method is improved. The low rank distribution and the similarity of the output characteristics of the two neural networks are superimposed by mixed distribution. After the normalization operation, all the subspace results are stitched together. Finally, the output is classified through the full connection layer. The experimental results show that, the speech emotion recognition method based on mixed distributed attention mechanism and hybrid neural network has higher accuracy than other existing models, verify- ing the validity of the proposed method.

Key words: speech emotion recognition, Mel frequency cepstral coefficient, bidirectional long short-term memory network, convolutional neural network, multi-head attention mechanism