A real-time facial manipulation video detection model based on ensemble learning dual-stream neural network

Computer Engineering & Science ›› 2023, Vol. 45 ›› Issue (03): 470-477.

• Computer Network and Znformation Security • Previous Articles Next Articles

A real-time facial manipulation video detection model based on ensemble learning dual-stream neural network

YUAN Ye1,2,3，HUANG Li-qing1,2,3，YE Feng1,2,3，HUANG Tian-qiang1,2,3，LUO Hai-feng1,2,3，XU Chao1,2,3

(1.College of Computer and Cyber Security，Fujian Normal University，Fuzhou 350117;
2.Digital Fujian Institute of Big Data Security Technology，Fuzhou 350117;
3.Fujian Provincial Engineering Research Center of Big Data Analysis and Application，Fuzhou 350117，China)

Received:2022-10-27 Revised:2022-12-25 Accepted:2023-03-25 Online:2023-03-25 Published:2023-03-22

Abstract

Abstract: Malicious face manipulation has a negative impact on social security and stability, and it is a very important issue to accurately detect video images after face tampering. In order to solve the problem of poor real-time performance of video manipulation detection model, this paper proposes a face manipulation video detection model based on ensemble learning dual-stream recurrent neural network, and introduces the voting mechanism in ensemble learning. The model first receives a small number of consecutive sequence frames, extracts spatial features through a convolutional neural network, and introduces central differential convolution to enhance tampering artifacts in the spatial domain. The model then differentiates consecutive sequence frames to enhance tampering artifacts in the temporal domain, while temporal feature extraction is performed through a convolutional neural network. Then, the model splices the dual-stream feature vectors in the spatial domain and the time domain, and performs feature extraction through a recurrent neural network. During the feature extraction process of the recurrent neural network , the frame-by-frame feature information is retained as the input of the subsequent auxiliary frame-level classifier, while the final output of the recurrent neural network is used as the input of the video-level discriminator. Finally, the model introduces the voting mechanism of the integrated model to integrate the outputs of multiple auxiliary frame-level discriminators and video-level discriminators, and introduces a weight hyperparameter γ to balance the importance of the auxiliary frame-level discriminator and video-level discriminator, helping the model to improve detection accuracy. On the FaceForensics++ dataset, the experimental results show that the proposed model improves the average accuracy by 0.4% and 1.0% compared with mainstream detection model. At the same time, the proposed model can only use fewer consecutive frames for manipulation detection, which improves the real-time performance of the model.

Key words: Deepfake, convolutional neural network, recurrent neural network, voting mechanism, central difference convolution

YUAN Ye, HUANG Li-qing, YE Feng, HUANG Tian-qiang, LUO Hai-feng, XU Chao, . A real-time facial manipulation video detection model based on ensemble learning dual-stream neural network[J]. Computer Engineering & Science, 2023, 45(03): 470-477.

[1]	XU Xin, LI Ruo-shi, YUAN Ye, LIU Na. Semantic segmentation of foggy driving scenes based on learnable image filter [J]. Computer Engineering & Science, 2024, 46(11): 2027-2034.
[2]	FU Yan, YANG Xu, YE Ou. A smoke recognition method based on CNN and Transformer feature fusion [J]. Computer Engineering & Science, 2024, 46(11): 2045-2052.
[3]	PAN Yu-qing, YU Hao, LI Feng. An abnormal sound detection method based on weighted non-negative matrix decomposition [J]. Computer Engineering & Science, 2024, 46(08): 1425-1432.
[4]	TIAN Hong-peng, WU Jing-wei. RIB-NER:A span-based Chinese named entity recognition model [J]. Computer Engineering & Science, 2024, 46(07): 1311-1320.
[5]	YIN Chun-yong, ZHAO Feng. An anomaly detection model of time series based on dual attention and deep autoencoder [J]. Computer Engineering & Science, 2024, 46(05): 826-835.
[6]	MA Chang-lin, SUN Zhuang. Distantly supervised relation extraction based on entity knowledge [J]. Computer Engineering & Science, 2024, 46(05): 945-950.
[7]	CHEN Jie, LI Cheng, LIU Zhong. Convolutional neural network inference and training vectorization method for multicore vector accelerators [J]. Computer Engineering & Science, 2024, 46(04): 580-589.
[8]	CAO Hao-dong, WANG Hai-tao, HE Jian-fen. Date-aware sequential recommendation fusing local information of sequences [J]. Computer Engineering & Science, 2024, 46(04): 734-742.
[9]	QIN Wen-qiang, WU Zhong-cheng, ZHANG Jun, LI Fang, . Design of convolutional neural network acceleration system based on heterogeneous platform [J]. Computer Engineering & Science, 2024, 46(01): 12-20.
[10]	ZHOU Li, ZHAO Zhi-qiao, PAN Guo-teng, TIE Jun-bo, ZHAO Wang. RISC-V based design of graph convolutional neural network accelerator [J]. Computer Engineering & Science, 2023, 45(12): 2113-2120.
[11]	ZHOU Ju-xiang, ZHOU Ming-tao, GAN Jian-hou, XU Jian. A question generation model with multi-stage temporal and semantic information enhancement [J]. Computer Engineering & Science, 2023, 45(10): 1847-1857.
[12]	YU Zi-cheng, LING Jie. A DGA domain name detection method based on Transformer and multi-feature fusion [J]. Computer Engineering & Science, 2023, 45(08): 1416-1423.
[13]	LIU Jun-qi, TU Wen-xuan, ZHU En. Survey on graph convolutional neural network [J]. Computer Engineering & Science, 2023, 45(08): 1472-1481.
[14]	YI Xiao, MA Sheng, XIAO Nong. Running optimization of deep learning accelerators under different pruning strategies [J]. Computer Engineering & Science, 2023, 45(07): 1141-1148.
[15]	LIU Yang, SU Hang, HE Qian, SHEN Pu, LIU Peng. An equipment fault detection method based on cloud-edge collaboration variational autoencoder neural network [J]. Computer Engineering & Science, 2023, 45(07): 1188-1196.

A real-time facial manipulation video detection model based on ensemble learning dual-stream neural network

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles 0

Metrics

Comments