Copyright protection of open-sourced datasets based on invisible backdoor watermarking

Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (6): 1013-1021.

• Computer Network and Znformation Security • Previous Articles Next Articles

Copyright protection of open-sourced datasets based on invisible backdoor watermarking

HUANG Zhi-hui,XIAO Xiang-li,ZHANG Yu-shu,XUE Ming-fu

(College of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 211106，China)

Received:2023-10-26 Revised:2023-12-01 Online:2024-06-25 Published:2024-06-17

Abstract

Abstract: To address the copyright protection issue in the field of image classification datasets, a traceable method based on invisible backdoor watermarking, named IBWOD, is proposed. This method ensures the watermark’s strong concealment while maintaining good usability and effectiveness. Firstly, an encoder-decoder network is used to embed the backdoor watermark into selected samples, generating watermark samples. Secondly, the labels of these watermark samples are modified to specified labels, and then the watermark samples are merged with unmodified samples to form a watermark dataset. Models trained using this watermark dataset will leave a specific backdoor, i.e., a mapping relationship from the backdoor watermark to the specified labels. Finally, a corresponding model verification algorithm is proposed, based on this special mapping relationship, to verify if a suspicious model has used the watermark dataset. Experimental results demonstrate that IBWOD can effectively verify whether a model has used the watermark dataset and possesses strong concealment.

Key words: open-sourced dataset, copyright protection, backdoor watermarking, machine learning, image classification

HUANG Zhi-hui, XIAO Xiang-li, ZHANG Yu-shu, XUE Ming-fu. Copyright protection of open-sourced datasets based on invisible backdoor watermarking[J]. Computer Engineering & Science, 2024, 46(6): 1013-1021.

[1]	LI Tianyun, LI Tao, WEN Dong, YANG Hui, ZHANG Yutao, LUO Xin, DONG Dezun. A survey on artificial intelligence based congestion control [J]. Computer Engineering & Science, 2025, 47(6): 1018-1027.
[2]	PENG Lin, ZHANG Peng, CHEN Junfeng, TANG Tao, HUANG Chun. Selection of sparse matrix multiplication algorithms based on supervised learning [J]. Computer Engineering & Science, 2025, 47(3): 381-391.
[3]	CHEN Wenjin. QTorch:A quantum-classical hybrid machine learning framework built on a standalone quantum programming language [J]. Computer Engineering & Science, 2025, 47(3): 412-421.
[4]	WANG Yufei, LIU Qiang, ZHANG Weizhen, WU Xiaojie, LI Jiawen, WANG Yuheng. rtTorTIM: A real-time Tor traffic identification method based on multi-modal feature fusion and Stacking ensemble learning [J]. Computer Engineering & Science, 2025, 47(2): 238-246.
[5]	WEN Xin, ZENG Tao, LI Chun-bo, XU Zi-chen. A switch method of model inference serving oriented to serverless computing [J]. Computer Engineering & Science, 2024, 46(7): 1210-1217.
[6]	DING Jian-ping, LI Wei-jun, LIU Xue-yang, CHEN Xu. A review of named entity recognition research [J]. Computer Engineering & Science, 2024, 46(7): 1296-1310.
[7]	WU Xia, ZHENG Hong-ying, XIAO Di. A dual-verification model watermarking scheme based on certification files [J]. Computer Engineering & Science, 2024, 46(4): 647-656.
[8]	GAO Shan, LI Shi-jie, CAI Zhi-ping. A survey of Chinese text classification based on deep learning [J]. Computer Engineering & Science, 2024, 46(4): 684-692.
[9]	HUANG Peng-cheng, FENG Chao-chao, MA Chi-yuan, . Machine learning prediction of timing violation under unknown corners [J]. Computer Engineering & Science, 2024, 46(3): 395-399.
[10]	LI Yang, YIN Da-peng, MA Zi-qiang , YAO Zi-hao, WEI Liang-gen, . Cache side-channel attack detection combining decision tree and AdaBoost [J]. Computer Engineering & Science, 2024, 46(3): 440-452.
[11]	PENG Chang, LIU Qing-zhi, CHEN Chang-bo, . Loop permutation and auto-tuning under polyhedral model [J]. Computer Engineering & Science, 2023, 45(12): 2121-2134.
[12]	ZHAO Zhen-yu, YANG Tian-hao, JIANG Wen-cheng, ZHANG Shu-zheng. A machine learning-based fast calculation method of multi-voltage, multi-temperature and multi-parameter standard cell delay [J]. Computer Engineering & Science, 2023, 45(08): 1331-1338.
[13]	LI Xiao-ling, FANG Jian-bin, MA Jun, TAN Shuang, TAN Yu-song. Automated task allocation of sparse matrix computation based on supervised learning [J]. Computer Engineering & Science, 2023, 45(05): 782-789.
[14]	HU Yan-fang, XIONG Wen, GAO Wei. An online game user churn prediction method based on Spark platform [J]. Computer Engineering & Science, 2022, 44(10): 1730-1737.
[15]	TANG Yang-kun, XIAN Gang, YANG Wen-xiang, YU Jie, ZHANG Xiao-rong, WANG Yao-bin. Job failure prediction based on user behavior on supercomputers [J]. Computer Engineering & Science, 2022, 44(10): 1753-1761.

Copyright protection of open-sourced datasets based on invisible backdoor watermarking

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles

Metrics

Comments