• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (06): 1013-1021.

• Computer Network and Znformation Security • Previous Articles     Next Articles

Copyright protection of open-sourced datasets based on invisible backdoor watermarking

HUANG Zhi-hui,XIAO Xiang-li,ZHANG Yu-shu,XUE Ming-fu   

  1. (College of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing  211106,China)
  • Received:2023-10-26 Revised:2023-12-01 Accepted:2024-06-25 Online:2024-06-25 Published:2024-06-17

Abstract: To address the copyright protection issue in the field of image classification datasets, a traceable method based on invisible backdoor watermarking, named IBWOD, is proposed. This method ensures the watermark’s strong concealment while maintaining good usability and effectiveness. Firstly, an encoder-decoder network is used to embed the backdoor watermark into selected samples, generating watermark samples. Secondly, the labels of these watermark samples are modified to specified labels, and then the watermark samples are merged with unmodified samples to form a watermark dataset. Models trained using this watermark dataset will leave a specific backdoor, i.e., a mapping relationship from the backdoor watermark to the specified labels. Finally, a corresponding model verification algorithm is proposed, based on this special mapping relationship, to verify if a suspicious model has used the watermark dataset. Experimental results demonstrate that IBWOD can effectively verify whether a model has used the watermark dataset and possesses strong concealment. 

Key words: open-sourced dataset, copyright protection, backdoor watermarking, machine learning, image classification