基于多层次注意力与动态特征融合的增强伪造人脸检测方法

摘要/Abstract

摘要： 随着生成对抗网络（GAN）等生成模型的快速演进，伪造人脸图像的质量持续提升，给社交媒体、身份认证与舆情安全带来严峻挑战，伪造图像检测已成为当前信息网络安全领域的研究热点。现有方法主要集中在空间域纹理分析、频率域伪痕提取或时序一致性建模等方向。然而，这些方法通常存在泛化能力弱，难以适应不断演化的伪造技术。本文针对上述问题，提出一种基于双分支结构的伪造人脸检测模型，分别在空间域和频率域提取多维特征，并引入可训练的动态特征融合模块，实现特征域间的自适应加权融合，增强特征互补性。同时，设计一种基于随机通道掩膜的图像增强策略，有效提升模型在多种伪造场景下的鲁棒性。实验结果表明，本文方法在多个基准数据集上均取得了优异的性能，并在跨数据集测试中展现出较强的泛化能力，为伪造图像检测提供了高效且具扩展性的解决方案。

关键词: 深度学习, 神经网络, 伪造人脸检测, 图像增强, 特征融合

Abstract: With the rapid evolution of generative models such as Generative Adversarial Networks (GAN), the quality of forged facial images has continued to improve, posing severe challenges to social media, identity authentication and public opinion security. Forged image detection has become a research hotspot in the current field of information network security. The existing methods mainly focus on directions such as spatial domain texture analysis, frequency domain artifact extraction or temporal consistency modeling, and introduce multiple attention mechanisms and feature fusion strategies to improve performance. However, these methods usually have problems such as weak generalization ability and rigid feature fusion mechanisms, making it difficult to adapt to the constantly evolving forgery techniques. This paper proposes a forged face detection model based on a dual-branch structure in response to the above problems. It extracts multi-dimensional features in the spatial domain and the frequency domain respectively, and introduces a trainable dynamic feature fusion module to achieve adaptive weighted fusion between feature domains and enhance feature complementarity. Meanwhile, an image enhancement strategy based on random channel masks is designed to effectively improve the robustness of the model in various forgery scenarios. The experimental results show that the method proposed in this paper has achieved excellent performance on multiple benchmark datasets and demonstrated strong generalization ability in cross-dataset tests, providing an efficient and scalable solution for forged image detection.

Key words: deep learning, neural network, forged face detection, image enhancement, feature fusion

赵娅, 郜明超, 姚文达, 徐锋. 基于多层次注意力与动态特征融合的增强伪造人脸检测方法[J]. 计算机工程与科学.

ZHAO Ya, GAO Mingchao, YAO Wenda, XU Feng. Enhanced forged face detection method based on multi-level attention and dynamic feature fusion[J]. Computer Engineering & Science.

[1]	张胜裕1, 2, 3, 宋慧慧2, 3, 4. 基于特征解耦的双流网络模型全色锐化[J]. 计算机工程与科学, 2025, 47(9): 1628-1637.
[2]	万众1, 陈任之1, 张翔宇2, 徐实1, 赵静月1, 艾勇保1, 杨智杰1, 王蕾1. 一种安全低功耗的无人机避障方法研究[J]. 计算机工程与科学, 2025, 47(9): 1658-1668.
[3]	程其宏1, 刘鹏1, 姚廉1, 尤志强2, 武继刚1. 一种针对固定故障的忆阻神经网络容错方案[J]. 计算机工程与科学, 2025, 47(9): 1691-1699.
[4]	郑伟巍, 郑重, 陈微, 陆洪毅. 基于TAGE与基于神经网络分支预测器的比较与分析[J]. 计算机工程与科学, 2025, 47(8): 1364-1380.
[5]	蒲小莉, 赖惠成, 高古学. BF-YOLO：基于YOLOv8改进的小目标检测算法[J]. 计算机工程与科学, 2025, 47(8): 1425-1436.
[6]	刘金竹, 张东, 李冠宇. 基于密集卷积和多特征感知的链接预测模型研究[J]. 计算机工程与科学, 2025, 47(8): 1483-1492.
[7]	高志玲1, 赵新宇1, 2. 基于PKUSEG-Text-GCN的肿瘤疾病预测模型[J]. 计算机工程与科学, 2025, 47(7): 1303-1311.
[8]	陈旭, 陈子雄, 景永俊, 王叔洋, 宋吉飞. 基于双曲图卷积神经网络的切片级漏洞检测方法[J]. 计算机工程与科学, 2025, 47(5): 851-863.
[9]	王莹, 杨青, 王翔宇, 张勇, . 基于非对称空间特征的脑电信号情感分析研究[J]. 计算机工程与科学, 2025, 47(5): 921-930.
[10]	李珍琪, 王强, 齐星云, 赖明澈, 赵言亢, 陆亿行, 黎渊. 轻量化卷积神经网络硬件加速设计及FPGA实现[J]. 计算机工程与科学, 2025, 47(4): 582-591.
[11]	王煜恒, 刘强, 伍晓洁. RCGNN：图注入攻击下的图神经网络鲁棒性认证方法[J]. 计算机工程与科学, 2025, 47(3): 434-447.
[12]	景永俊, 王浩, 邵堃, 王晓峰. 一种基于图热核扩散卷积的网络入侵检测方法[J]. 计算机工程与科学, 2025, 47(3): 459-471.
[13]	李娇, 高磊怡, 张瑞欣, 吴越, 邓红霞. 基于脉冲注意力机制的轻量化面部超分重建方法[J]. 计算机工程与科学, 2025, 47(3): 494-503.
[14]	陈宇灵, 李翔. 基于图结构提示实现低资源场景下的节点分类[J]. 计算机工程与科学, 2025, 47(3): 534-547.
[15]	黄颖, 唐敏, . 基于深度神经网络的隐私保护基因检测[J]. 计算机工程与科学, 2025, 47(2): 265-275.

基于多层次注意力与动态特征融合的增强伪造人脸检测方法

Enhanced forged face detection method based on multi-level attention and dynamic feature fusion

PDF

可视化

摘要/Abstract

引用本文

使用本文

相关文章 15

编辑推荐

Metrics

本文评价