Facial expression recognition based on network fusion to improve MobileViT

Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (06): 1072-1080.

• Graphics and Images • Previous Articles Next Articles

Facial expression recognition based on network fusion to improve MobileViT

DENG Xiang-yu,PEI Hao-yuan,SHENG Ying

(College of Physics and Electronic Engineering，Northwest Normal University，Lanzhou 730070，China)

Received:2023-04-26 Revised:2023-10-13 Accepted:2024-06-25 Online:2024-06-25 Published:2024-06-18

Abstract

Abstract: From the perspective of lightweight models, a facial expression recognition network based on network fusion to improve MobileViT is proposed. This network integrates multi-scale convolution PSConv and attention mechanisms through residual structures to form the RAPsconv feature reconstruction module. This module can more efficiently extract multi-scale features from a fine-grained perspective, enhancing the expression of key features, thereby improving the network's expressive ability and constructing an end-to-end facial expression recognition network. Additionally, to further narrow the gap between similar expressions, a loss function combining Softmax Loss and Center Loss is proposed, effectively reducing the misjudgment rate of expression recognition. Experimental results demonstrate that the improved network achieves higher accuracy on three natural scene expression datasets FER2013, FER+, and RAF-DB compared to the base network MobileViT, with accuracy improvements of 1.73%, 2.18%, and 1.64%, respectively. The improved network has fewer parameters, stronger robustness, and is suitable for lightweighting and integration, making it suitable for real-world applications in facial expression recognition.

Key words: facial expression recognition, MobileViT, multi-scale convolutional PSConv, attention mechanism, network fusion, lightweight network

DENG Xiang-yu, PEI Hao-yuan, SHENG Ying. Facial expression recognition based on network fusion to improve MobileViT[J]. Computer Engineering & Science, 2024, 46(06): 1072-1080.

[1]	XU Chao, RUAN Rongyao, CHEN Yong, . A blockchain-based medical data auditing method [J]. Computer Engineering & Science, 2025, 47(01): 95-106.
[2]	CHEN Zhaobo, ZHANG Lin, MA Xiaoxuan. Video anomaly detection with improved attention hybrid auto-encoder [J]. Computer Engineering & Science, 2025, 47(01): 130-139.
[3]	FU Yan, YANG Xu, YE Ou. A smoke recognition method based on CNN and Transformer feature fusion [J]. Computer Engineering & Science, 2024, 46(11): 2045-2052.
[4]	LIU Guo-qi, HE Ting-nian, RONG Yi-xuan, LI Zhuo-ran . A point of interest recommendation model based on tracks and friend relationship of users [J]. Computer Engineering & Science, 2024, 46(09): 1693-1701.
[5]	LIU Xiao-hua, XU Ru-zhi, YANG Cheng-yue. A Chinese named entity recognition model based on multi-feature fusion embedding#br# [J]. Computer Engineering & Science, 2024, 46(08): 1473-1481.
[6]	ZHANG Yong-zhi, HE Ke-ren, GE Jue. Low-altitude remote sensing image object detection based on improved YOLOv7 network [J]. Computer Engineering & Science, 2024, 46(07): 1269-1277.
[7]	WANG Ze-yu, XU Hui-ying, ZHU Xin-zhong, LI Chen, LIU Zi-yang, WANG Zi-yi. An improved dense pedestrian detection algorithm based on YOLOv8: MER-YOLO [J]. Computer Engineering & Science, 2024, 46(06): 1050-1062.
[8]	ZHANG Yu-ying, ZHU Guang-li, TAN Guang-pu, . A financial implicit sentiment analysis model based on sentiment enhancement and semantic dependency [J]. Computer Engineering & Science, 2024, 46(06): 1112-1120.
[9]	YIN Chun-yong, ZHAO Feng. An anomaly detection model of time series based on dual attention and deep autoencoder [J]. Computer Engineering & Science, 2024, 46(05): 826-835.
[10]	FAN Qi, WANG Shan-min, LIU Cheng-guang, LIU Qing-shan. Multi-target domain facial expression recognition based on class-wise feature constraint [J]. Computer Engineering & Science, 2024, 46(05): 836-845.
[11]	ZHAO Jin-yuan, JIA Di. A multi-person pose estimation correction algorithm based on improved YOLOv5 [J]. Computer Engineering & Science, 2024, 46(05): 852-860.
[12]	MA Chang-lin, SUN Zhuang. Distantly supervised relation extraction based on entity knowledge [J]. Computer Engineering & Science, 2024, 46(05): 945-950.
[13]	CAO Hao-dong, WANG Hai-tao, HE Jian-fen. Date-aware sequential recommendation fusing local information of sequences [J]. Computer Engineering & Science, 2024, 46(04): 734-742.
[14]	YAO Yuan-yuan, LIU Yu-hang, CHENG Yu-jing, PENG Meng-xiao, ZHENG Wen, . Self-supervised few-shot medical image segmentation with multi-attention mechanism [J]. Computer Engineering & Science, 2024, 46(03): 479-487.
[15]	JIN Guang-yin, ZHAO Xu-jun, GONG Yi-xuan. Moving trajectory destination prediction based on long short-term memory network [J]. Computer Engineering & Science, 2024, 46(03): 525-534.

Facial expression recognition based on network fusion to improve MobileViT

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles 0

Metrics

Comments