An adversarial examples defense method for image reconstruction based on SCViT

Computer Engineering & Science ›› 2026, Vol. 48 ›› Issue (3): 500-511.

• Graphics and Images • Previous Articles Next Articles

An adversarial examples defense method for image reconstruction based on SCViT

ZHANG Xinjun,GUO Jifa

(School of Electronic and Information Engineering,Liaoning Technical University,Huludao 125105,China)

Received:2024-05-21 Revised:2024-09-13 Online:2026-03-25 Published:2026-03-25

Abstract

Abstract: The growing development of artificial intelligence (AI) has brought great convenience to people’s lives, but it has also gradually triggered human contemplation regarding its security. Image classification is a crucial research task in the field of computer vision; however, the vulnerability of deep neural networks makes them susceptible to attacks from adversarial examples. Adversarial examples represent a significant research direction within the realm of AI security, with a plethora of techniques emerging for both generating and defending against them. This paper introduces modifications based on the vision Transformer (ViT) and proposes a novel model, similarity comparison vision Transformer (SCViT), for comparing the similarity of image patches. In SCViT, image patches are processed through a linear projection layer and a Transformer Encoder to obtain corresponding representation vectors. The cosine similarity between these vectors is then calculated to determine the degree of similarity between image patches. To mitigate the influence of positional encoding on similarity computation, a small coefficient, denoted as α, is introduced before the positional encoding in SCViT. By utilizing SCViT for image patches similarity comparison, clean sample patches are used to replace adversarial sample patches one by one. Subsequently, all replaced clean sample patches are concatenated to form a new image for classification. Experimental results on the CIFAR-10 dataset demonstrate that selecting an appropriate value for α can enhance the defensive performance of the proposed method. Furthermore, experiments conducted on the Inception_v3 and Inception_v4 classification models indicate that the proposed method exhibits good transferability across different classification networks. Compared with several commonly used image reconstruction defense methods, the proposed method not only achieves superior defensive performance but also demonstrates greater robustness, with image classification accuracy exceeding 80% against 4 types of attack methods. Additionally, experiments on the CIFAR-100 and ImageNet datasets show that the classification accuracy for adversarial examples improves by over 54 percentage points and 46 percentage points, respectively, highlighting the versatility of the proposed method.

Key words: image classification, adversarial example, image stitching, vision Transformer, poisson fusion

ZHANG Xinjun, GUO Jifa. An adversarial examples defense method for image reconstruction based on SCViT[J]. Computer Engineering & Science, 2026, 48(3): 500-511.

[1]	TENG Shangzhi, MEI Changwang, YOU Xindong, Lv Xueqiang. Integrating multi-scale information and feature mapping relationships for hierarchical multi-granularity image classification [J]. Computer Engineering & Science, 2026, 48(3): 488-499.
[2]	GONG Haocheng, ZHU Hai, HUANG Zifei, YANG Mingze, ZHANG Kaiyu, WU Fei. A representation knowledge distillation-based WiFi gesture recognition method [J]. Computer Engineering & Science, 2025, 47(4): 655-666.
[3]	LIU Qiang, LI Mu-chun, WU Xiao-jie, WANG Yu-heng. S-JSMA: A fast JSMA adversarial example generation method with low disturbance redundancy [J]. Computer Engineering & Science, 2024, 46(8): 1395-1402.
[4]	XU Guang-yu, DING Jian. A large parallax image stitching algorithm based on feature clustering [J]. Computer Engineering & Science, 2022, 44(2): 283-290.
[5]	SONG Wei1，WANG Yongbo1，ZHANG Peipei1,2. An improved UAV image stitching algorithm based on AKAZE feature [J]. Computer Engineering & Science, 2019, 41(5): 873-878.
[6]	MENG Jianliang,WAGN Yaji. A best point smoothing method for finding the best seam [J]. J4, 2015, 37(7): 1387-1392.
[7]	LI Yucheng,ZHAO Xingcai,LI Guohui. Research of correction and stitching of parking image distortion [J]. J4, 2014, 36(7): 1341-1346.
[8]	. [J]. J4, 2007, 29(2): 53-55.

An adversarial examples defense method for image reconstruction based on SCViT

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 8

Recommended Articles

Metrics

Comments