• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学

• • 上一篇    下一篇

基于多层次注意力与动态特征融合的增强伪造人脸检测方法

赵娅, 郜明超, 姚文达,徐锋   

  1. (1.东北石油大学 计算机与信息技术学院, 黑龙江 大庆 163318;
    2. 黑龙江省石油大数据与智能分析重点实验室, 黑龙江 大庆 163318;
    3. 山东省滕州市公安局 网络警察大队, 山东 滕州 277500)

Enhanced forged face detection method based on multi-level attention and dynamic feature fusion

ZHAO Ya, GAO Mingchao, YAO Wenda, XU Feng   

  1. (1. School of Computer and Information Technology, Northeast Petroleum University, Daqing Heilongjiang 163318, China;
    2. Heilongjiang Key Laboratory of Big Data and Intelligent Analysis of Petroleum, Daqing Heilongjiang 163318, China;
    3. Cyber police brigade, Tengzhou Public Security Bureau, Tengzhou Shandong 277500, China)

摘要: 随着生成对抗网络(GAN)等生成模型的快速演进,伪造人脸图像的质量持续提升,给社交媒体、身份认证与舆情安全带来严峻挑战,伪造图像检测已成为当前信息网络安全领域的研究热点。现有方法主要集中在空间域纹理分析、频率域伪痕提取或时序一致性建模等方向。然而,这些方法通常存在泛化能力弱,难以适应不断演化的伪造技术。本文针对上述问题,提出一种基于双分支结构的伪造人脸检测模型,分别在空间域和频率域提取多维特征,并引入可训练的动态特征融合模块,实现特征域间的自适应加权融合,增强特征互补性。同时,设计一种基于随机通道掩膜的图像增强策略,有效提升模型在多种伪造场景下的鲁棒性。实验结果表明,本文方法在多个基准数据集上均取得了优异的性能,并在跨数据集测试中展现出较强的泛化能力,为伪造图像检测提供了高效且具扩展性的解决方案。

关键词: 深度学习, 神经网络, 伪造人脸检测, 图像增强, 特征融合

Abstract: With the rapid evolution of generative models such as Generative Adversarial Networks (GAN), the quality of forged facial images has continued to improve, posing severe challenges to social media, identity authentication and public opinion security. Forged image detection has become a research hotspot in the current field of information network security. The existing methods mainly focus on directions such as spatial domain texture analysis, frequency domain artifact extraction or temporal consistency modeling, and introduce multiple attention mechanisms and feature fusion strategies to improve performance. However, these methods usually have problems such as weak generalization ability and rigid feature fusion mechanisms, making it difficult to adapt to the constantly evolving forgery techniques. This paper proposes a forged face detection model based on a dual-branch structure in response to the above problems. It extracts multi-dimensional features in the spatial domain and the frequency domain respectively, and introduces a trainable dynamic feature fusion module to achieve adaptive weighted fusion between feature domains and enhance feature complementarity. Meanwhile, an image enhancement strategy based on random channel masks is designed to effectively improve the robustness of the model in various forgery scenarios. The experimental results show that the method proposed in this paper has achieved excellent performance on multiple benchmark datasets and demonstrated strong generalization ability in cross-dataset tests, providing an efficient and scalable solution for forged image detection.

Key words: deep learning, neural network, forged face detection, image enhancement, feature fusion