• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2021, Vol. 43 ›› Issue (08): 1461-1469.

Previous Articles     Next Articles

Source cell-phone identification under practical noises based on temporal convolutional network

WU Zhang-qian1,SU Zhao-pin1,2,3,4,WU Qin-fang1,ZHANG Guo-fu1,2,3,4   

  1. (1.School of Computer Science and Information Engineering,Hefei University of Technology,Hefei  230601;

    2.Intelligent Interconnected Systems Laboratory of Anhui Province (Hefei University of Technology),Hefei 230009;

    3.Anhui Province Key Laboratory of Industry Safety and 
    Emergency Technology (Hefei University of Technology),Hefei 230601;

    4.Engineering Research Center of Safety Critical Industrial Measurement and Control Technology,
    Ministry of Education,Hefei 230601,China)

  • Received:2020-05-19 Revised:2020-08-24 Accepted:2021-08-25 Online:2021-08-25 Published:2021-08-24

Abstract: To solve the problem of source cell-phone identification under practical environmental noises, a source cell-phone identification method based on linear discriminant analysis and temporal convolutional network is proposed. Firstly, the classification performance of different speech features under practical noises is analyzed in detail, based on which a new mixed speech feature is proposed according to band energy descriptor, constant Q transform, and linear discriminant analysis. Additionally, the mixed speech feature is used as the input to the temporal convolutional network for training and classification. Finally, the test results on the practical noise speech database of 10 brands, 47 mobile phone models, and 32,900 speech samples show that the average recognition accuracy of the proposed method reaches 99.82%. Moreover, compared with the two existing classical methods based on the band energy descriptor and support vector machine, and the constant Q transform domain and convolutional neural network, the proposed method increases the average recognition accuracy by about 0.44 and 0.54 percentages respectively, the average recall by about 0.45 and 0.55 percentages respectively, the average precision by about 0.41 and 0.57 percentages respectively, and the average F1-score by about 0.49 and 0.55 percentages respectively. The experimental results show that the proposed method has better comprehensive identification performance.

Key words: source cell-phone identification, practical environmental noise, mixed feature, linear discriminant analysis, temporal convolutional network