An unsupervised video summarization algorithm based on deep and shallow feature fusion

Computer Engineering & Science ›› 2023, Vol. 45 ›› Issue (09): 1602-1610.

• Graphics and Images • Previous Articles Next Articles

An unsupervised video summarization algorithm based on deep and shallow feature fusion

ZENG Fan-feng,WANG Chun-zhen,LI Chen

(School of Information,North China University of Technology,Beijing 100144,China)

Received:2022-06-09 Revised:2022-10-18 Accepted:2023-09-25 Online:2023-09-25 Published:2023-09-12

Abstract

Abstract: To solve the problem that the existing unsupervised video summarization algorithms do not accurately judge the importance of video frames, an unsupervised video summarization algorithm based on deep and shallow feature fusion is proposed. The deep features of video frames are extracted by a Convolutional Neural Network (CNN), while the shallow features are first extracted by the Speeded Up Robust Features (SURF) operator and then encoded using the Bag-of-Words (BOW) model. The deep and shallow features are fused to enrich the information of the feature descriptors as the input of the network model. A Bidirectional Long Short-Term Memory network (BiLSTM) is used to model the temporal information and output frame importance scores. The model is optimized using reinforcement learning. For generating static video summaries, a keyframe selection method based on local maxima is designed, which follows the temporal structure of the original video and avoids redundancy. Compared with several unsupervised video summarization algorithms on the SumMe and TVSum datasets, experimental results show that the proposed algorithm can make more accurate judgments on video content and generate higher-quality summaries.

Key words: video summarization, feature fusion, bi-directional long short-term memory (BiLSTM) network, reinforcement learning, local maximum

ZENG Fan-feng, WANG Chun-zhen, LI Chen. An unsupervised video summarization algorithm based on deep and shallow feature fusion[J]. Computer Engineering & Science, 2023, 45(09): 1602-1610.

[1]	ZHANG Zheng, XIA Xiaoyun, CHEN Zefeng, XIANG Yi. A staged strategy incorporating reinforcement learning to solve the travelling thief problem [J]. Computer Engineering & Science, 2025, 47(01): 140-149.
[2]	YU Shirui, JIANG Chunmao. A cloud computing virtual machine scheduling strategy based on fuzzy reinforcement learning [J]. Computer Engineering & Science, 2025, 47(01): 56-65.
[3]	MA Jin-lin, YAN Qi, MA Zi-ping. A multi-layer mask recognition method for Tangut characters [J]. Computer Engineering & Science, 2024, 46(12): 2227-2238.
[4]	FU Yan, YANG Xu, YE Ou. A smoke recognition method based on CNN and Transformer feature fusion [J]. Computer Engineering & Science, 2024, 46(11): 2045-2052.
[5]	DUAN Cheng-long, YUAN Jie, CHANG Qian-kun, ZHANG Ning-ning. Inverse reinforcement learning algorithm based on D2GA [J]. Computer Engineering & Science, 2024, 46(11): 2053-2062.
[6]	GU Ying-cheng, WEI Liu, JIANG Ning, CHENG Huan-yu, LIU Kai, SONG Yu, LIU Mei-zhao, TANG Lei, CHEN Yu, ZHANG Sheng. Edge server assignment for distributed interactive applications in edge environments [J]. Computer Engineering & Science, 2024, 46(10): 1748-1756.
[7]	CAI Yu, GUAN Zheng, WANG Zeng-wen, WANG Xue, YANG Zhi-jun. Resource allocation algorithm for distinguished services in vehicular networks based on multi-agent deep reinforcement learning [J]. Computer Engineering & Science, 2024, 46(10): 1757-1764.
[8]	LIU Xiao-hua, XU Ru-zhi, YANG Cheng-yue. A Chinese named entity recognition model based on multi-feature fusion embedding#br# [J]. Computer Engineering & Science, 2024, 46(08): 1473-1481.
[9]	ZHUANG Shu-xin, CHEN Yong-hong, HAO Yi-hang, WU Wei-wei, XU Xue-yong, WANG Wan-yuan. A population diversity-based robust policy generation method in adversarial game environments#br# [J]. Computer Engineering & Science, 2024, 46(06): 1081-1091.
[10]	LI Xin-jie, WANG Wen-jun, DONG Ling, LAI Hua, YU Zheng-tao, GAO Sheng-xiang, . An unsupervised phoneme segmentation method for Lao language with multi-feature interaction fusion [J]. Computer Engineering & Science, 2024, 46(05): 937-944.
[11]	Xie-zhong, CHEN Xu, JING Yong-jun, WANG Shu-yang. Semi-supervised website topic classification based on hetero-geneous graph neural networkWANG [J]. Computer Engineering & Science, 2024, 46(04): 635-646.
[12]	YU Tian-ci, GAO Shang. A code summarization generation model fusing multi-structure data [J]. Computer Engineering & Science, 2024, 46(04): 667-675.
[13]	YANG Xiao-qiang, HUANG Jia-cheng. A multi-branch fine-grained recognition method based on dynamic localization and feature fusion [J]. Computer Engineering & Science, 2024, 46(02): 253-263.
[14]	JIANG Zhi-peng, WANG Zi-quan, ZHANG Yong-sheng, YU Ying, CHENG Bin-bin, ZHAO Long-hai, ZHANG Meng-wei. A vehicle object detection algorithm in UAV video stream based on improved Deformable DETR [J]. Computer Engineering & Science, 2024, 46(01): 91-101.
[15]	LI Zhuo-xuan, ZHOU Ya-tong. iSFF-DBNet:An improved text detection algorithm in e-commerce images [J]. Computer Engineering & Science, 2023, 45(11): 2008-2017.

An unsupervised video summarization algorithm based on deep and shallow feature fusion

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles 0

Metrics

Comments