基于CNN和Transformer特征融合的烟雾识别方法

计算机工程与科学 ›› 2024, Vol. 46 ›› Issue (11): 2045-2052.

基于CNN和Transformer特征融合的烟雾识别方法

付燕,杨旭，叶鸥

（西安科技大学计算机科学与技术学院，陕西西安 710600）

收稿日期:2023-08-15 修回日期:2023-12-19 接受日期:2024-11-25 出版日期:2024-11-25 发布日期:2024-11-27
基金资助:
中国博士后科学基金(2020M673446)

A smoke recognition method based on CNN and Transformer feature fusion

FU Yan,YANG Xu,YE Ou

(College of Computer Science & Technology,Xi’an University of Science and Technology,Xi’an 710600,China)

Received:2023-08-15 Revised:2023-12-19 Accepted:2024-11-25 Online:2024-11-25 Published:2024-11-27

摘要/Abstract

摘要： 当前许多烟雾识别方法存在虚警率较高的问题，部分原因是当前大部分卷积神经网络(CNN)在特征提取过程中主要关注烟雾图像的局部信息，而忽略了烟雾图像的全局特征。这种偏重于局部信息的处理方式在处理多变且复杂的烟雾图像时，容易导致误判的情况发生。为了解决这一问题，需要更加准确地捕捉烟雾图像的全局特征，从而改善烟雾识别方法的准确性。因此，提出了一种结合Inception和Transformer结构的双分支烟雾识别方法TCF-Net。该方法改进了Inception模型，既丰富了特征种类，又减少了通道数的冗余；其次，引入了Transformer中的自注意力机制，将自注意力机制学习全局上下文信息的能力与卷积神经网络学习局部相对位置信息的能力相结合，在特征提取过程中嵌入了特征耦合模块FCU，连续地对双分支中的局部特征和全局信息进行交互，以最大程度保留双分支中的局部信息和全局信息，提高本文方法的性能。该方法能够对视频帧进行分类，将其识别为3种状态：黑色烟雾、白色烟雾和无烟雾。实验结果显示，改进后的烟雾识别方法可以更好地提取烟雾的特征，在降低虚警率的同时将准确率提升至97.8%，证实了该方法具有较好的性能。

关键词: 烟雾识别, 卷积神经网络, 自注意力机制, 特征融合

Abstract: Currently, many smoke recognition algorithms suffer from high false alarm rates, partly due to the fact that most existing convolutional neural networks (CNNs) mainly focus on local information in smoke images during feature extraction, neglecting the global features of smoke images. This bias towards local information processing can easily lead to misjudgments when dealing with variable and complex smoke images. To address this issue, it is necessary to capture the global features of smoke images more accurately, thereby improving the accuracy of smoke recognition algorithms. Therefore, this paper propose a dual-branch smoke recognition method, TCF-Net, which combines the Inception and Transformer structures. This model is improved to enrich feature diversity while reducing channel redundancy. Additionally, the self-attention mechanism from Transformer is introduced, combining its ability to learn global context information with CNNs capacity to learn local relative position information. During feature extraction, a feature coupling unit (FCU) is embedded to continuously interact the local features and global information in both branches, maximizing the retention of both local and global information and enhancing the performance of the algorithm. The proposed algorithm can classify video frames into three states: black smoke, white smoke, and no smoke. Experimental results show that the improved network can better extract smoke features, reducing the false alarm rate while increasing the accuracy to 97.8%, confirming the excellent performance of the algorithm.

Key words: smoke recognition, convolutional neural network, self-attention mechanism, feature fusion

付燕, 杨旭, 叶鸥. 基于CNN和Transformer特征融合的烟雾识别方法[J]. 计算机工程与科学, 2024, 46(11): 2045-2052.

FU Yan, YANG Xu, YE Ou. A smoke recognition method based on CNN and Transformer feature fusion[J]. Computer Engineering & Science, 2024, 46(11): 2045-2052.

编辑推荐

Metrics

阅读次数

全文

279

HTML			PDF

最新录用	在线预览	正式出版	最新录用	在线预览	正式出版
0	0	0	0	0	279

来源	本网站	其他网站

次数	214	65
比例	77%	23%

摘要

159

最新录用	在线预览	正式出版

0	0	159

	来源	本网站

	次数	159
	比例	100%

[1]	徐欣, 李若诗, 袁野, 刘娜. 基于可学习图像滤波器的雾天驾驶场景图像语义分割[J]. 计算机工程与科学, 2024, 46(11): 2027-2034.
[2]	刘国岐, 何廷年, 荣艺煊, 李卓然. 基于用户轨迹和好友关系的兴趣点推荐[J]. 计算机工程与科学, 2024, 46(09): 1693-1701.
[3]	潘雨青, 于浩, 李峰. 基于加权非负矩阵分解的异常声音检测方法研究[J]. 计算机工程与科学, 2024, 46(08): 1425-1432.
[4]	刘晓华, 徐茹枝, 杨成月. 一种基于多特征融合嵌入的中文命名实体识别模型研究[J]. 计算机工程与科学, 2024, 46(08): 1473-1481.
[5]	田红鹏, 吴璟玮. RIB-NER：基于跨度的中文命名实体识别模型[J]. 计算机工程与科学, 2024, 46(07): 1311-1320.
[6]	尹春勇, 赵峰. 基于双层注意力和深度自编码器的时间序列异常检测模型[J]. 计算机工程与科学, 2024, 46(05): 826-835.
[7]	马长林, 孙状. 基于实体知识的远程监督关系抽取[J]. 计算机工程与科学, 2024, 46(05): 945-950.
[8]	陈杰, 李程, 刘仲. 面向多核向量加速器的卷积神经网络推理和训练向量化方法[J]. 计算机工程与科学, 2024, 46(04): 580-589.
[9]	王谢中, 陈旭, 景永俊, 王叔洋. 基于异构图神经网络的半监督网站主题分类[J]. 计算机工程与科学, 2024, 46(04): 635-646.
[10]	曹浩东, 汪海涛, 贺建峰. 融合序列局部信息的日期感知序列推荐算法[J]. 计算机工程与科学, 2024, 46(04): 734-742.
[11]	晋广印, 赵旭俊, 龚艺璇. 基于长短期记忆网络的移动轨迹目的地预测[J]. 计算机工程与科学, 2024, 46(03): 525-534.
[12]	秦文强, 吴仲城, 张俊, 李芳, . 基于异构平台的卷积神经网络加速系统设计[J]. 计算机工程与科学, 2024, 46(01): 12-20.
[13]	周理, 赵祉乔, 潘国腾, 铁俊波, 赵王. 基于RISC-V的图卷积神经网络加速器设计[J]. 计算机工程与科学, 2023, 45(12): 2113-2120.
[14]	余子丞, 凌捷. 基于Transformer和多特征融合的DGA域名检测方法[J]. 计算机工程与科学, 2023, 45(08): 1416-1423.
[15]	刘俊奇, 涂文轩, 祝恩. 图卷积神经网络综述[J]. 计算机工程与科学, 2023, 45(08): 1472-1481.