• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (11): 2045-2052.

• Graphics and Images • Previous Articles     Next Articles

A smoke recognition method based on CNN and Transformer feature fusion

FU Yan,YANG Xu,YE Ou   

  1. (College of Computer Science & Technology,Xi’an University of Science and Technology,Xi’an 710600,China)
  • Received:2023-08-15 Revised:2023-12-19 Accepted:2024-11-25 Online:2024-11-25 Published:2024-11-27

Abstract: Currently, many smoke recognition algorithms suffer from high false alarm rates, partly due to the fact that most existing convolutional neural networks (CNNs) mainly focus on local information in smoke images during feature extraction, neglecting the global features of smoke images. This bias towards local information processing can easily lead to misjudgments when dealing with variable and complex smoke images. To address this issue, it is necessary to capture the global features of smoke images more accurately, thereby improving the accuracy of smoke recognition algorithms. Therefore, this paper propose a dual-branch smoke recognition method, TCF-Net, which combines the Inception and Transformer structures. This model is improved to enrich feature diversity while reducing channel redundancy. Additionally, the self-attention mechanism from Transformer is introduced, combining its ability to learn global context information with CNNs capacity to learn local relative position information. During feature extraction, a feature coupling unit (FCU) is embedded to continuously interact the local features and global information in both branches, maximizing the retention of both local and global information and enhancing the performance of the algorithm. The proposed algorithm can classify video frames into three states: black smoke, white smoke, and no smoke. Experimental results show that the improved network can better extract smoke features, reducing the false alarm rate while increasing the accuracy to 97.8%, confirming the excellent performance of the algorithm.

Key words: smoke recognition, convolutional neural network, self-attention mechanism, feature fusion