• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2025, Vol. 47 ›› Issue (11): 1964-1973.

• Computer Network and Znformation Security • Previous Articles     Next Articles

Multi-stage detection and multimodal localization for audio deletion tampering

ZHANG Guofu,WANG Ru,SU Zhaopin,YUE Feng,LIAN Chensi,YANG Bo   

  1. (1.School of Computer Science and Information Engineering(School of Artificial Intelligence),
    Hefei University of Technology,Hefei 230601;
    2.Anhui Province Key Laboratory of Industry Safety & Emergency Technology
     (Hefei University of Technology),Hefei 230601;
    3.Joint Laboratory of Intelligent Prevention and Recognition of Audio and Video,Hefei 230009;
    4.Intelligent Interconnection System Anhui Provincial Laboratory (Hefei University of Technology),Hefei 230009;
    5.Anhui Provincial Public Security Department,Physical Evidence Identification and Management Division,Hefei 230000,China)
  • Received:2024-11-14 Online:2025-11-25 Published:2025-12-05

Abstract:

Audio deletion tampering detection faces severe challenges in the field of digital audio authentication, particularly under anti-forensic attacks. To address the difficulties in detecting and locating deletion tampering, a multi-stage detection and multimodal localization method for audio deletion tampering is proposed. Firstly, a header information analysis method is designed to screen out audio files suspected of undergoing header/footer deletion tampering. Subsequently, a column-average-based constant Q spectral sketch feature is introduced, along with a middle deletion tampering classification network that leverages a deep residual shrinkage network and an attention mechanism. Next, by integrating the results from header information analysis and the classification network, a comprehensive judgment is made on whether the audio deletion tampering has occurred. Finally, for detected middle deletion tampering, a localization method combining wavelet packet analysis with multimodal features is proposed. Comparative experimental results demonstrate that the proposed method can effectively detect header/footer deletion tampering and accurately locate middle deletion tampering. Specifically, the accuracy, precision, recall, and F1 score for middle deletion classification all exceed 98%, and the method exhibits enhanced robustness and localization accuracy when faced with conventional signal processing attacks.



Key words: audio blind forensics, delete tampering, detection and localization, deep residual shrinkage network, wavelet packet reconstruction