基于深度学习的草图检索方法研究进展

计算机工程与科学 ›› 2021, Vol. 43 ›› Issue (12): 2190-2205.

基于深度学习的草图检索方法研究进展

姬子恒，王斌

（南京财经大学信息工程学院，江苏南京210023）

收稿日期:2020-09-03 修回日期:2020-11-04 出版日期:2021-12-25 发布日期:2021-12-31
基金资助:
国家自然科学基金(61372158,61876037);江苏省自然科学基金(BK20181414);江苏省高校优秀科技创新团队项目(2017-15);江苏省高校自然科学研究重大项目 (18KJA52004);江苏省研究生科研与实践创新计划(KYCX19_1358)

Research progress on deep learning based sketch retrieval

JI Zi-heng,WANG Bin

（School of Information Engineering,Nanjing University of Finance and Economics,Nanjing 210023,China）

Received:2020-09-03 Revised:2020-11-04 Online:2021-12-25 Published:2021-12-31

摘要/Abstract

摘要： 草图检索(SBIR)是基于内容的图像检索(CBIR)的扩展，是一种灵活便捷的目标图像检索方式，其研究的焦点是如何减少手绘草图域与自然图像域之间的域差。传统方法提取手工特征完成草图域与图像域之间的近似转换以减少域差，但该类方法无法有效拟合2个域内容，导致检索精度不高。深度学习方法依赖大量数据进行图像高维特征的提取，突破了传统方法的局限，已被证明可以有效解决跨域建模问题。研究聚焦于基于深度学习的草图检索方法，在深度特征提取模型、公开的数据测试集、粗粒度和细粒度检索、哈希技术和类别泛化等几个方面对草图检索的深度学习方法的相关研究工作进行了综述和评论。然后进行了实验比较研究，一方面，对现有3个公开的SBIR测试集Sketchy、TU-Berlin和QuickDraw进行适用性评估;另一方面，选取3个最新的SBIR深度学习模型GRLZS模型、SEM-PCYC模型和SAKE模型进行性能分析与比较。最后，对草图检索面临的挑战和未来研究方向进行了总结与展望。

关键词: 深度学习, 草图检索, 特征提取, 基于内容的图像检索

Abstract: Sketch retrieval (SBIR) is an extension of content-based image retrieval (CBIR), which is a flexible and convenient way to retrieve target images. How to minimize the difference between the sketch domain and the image domain is crucial to SBIR. The traditional methods extract the manual features to achieve the approximate conversion between the sketch field and the image field, so as to reduce the domain difference. However, these methods cannot effectively fit the content of the two domains, resulting in low retrieval accuracy. Deep learning methods break through the limitations of traditional methods, which extract high-dimensional features from a large amount of data and have been proved to effectively solve the cross-domain modeling problems. This paper focuses on deep learning-based sketch retrieval methods, and covers several aspects such as the deep feature extraction model, public dataset, coarse-grained and fine-grained retrieval based on deep learning, deep hashing technology, category generalization, etc. Related works are reviewed and commented on. Then, a comparative experiment is conducted. For one hand, three existing public SBIR datasets such as Sketchy, TU-Berlin and QuickDraw are used for suitability evaluation. For the other hand, three latest SBIR deep learning models such as GRLZS model, SEM-PCYC model and SAKE model are selected for performance analysis and comparison. Finally, current challenges and future research trends of SBIR are summarized.

Key words: deep learning, sketch retrieval, feature extraction, content-based image retrieval

姬子恒, 王斌. 基于深度学习的草图检索方法研究进展[J]. 计算机工程与科学, 2021, 43(12): 2190-2205.

JI Zi-heng, WANG Bin. Research progress on deep learning based sketch retrieval[J]. Computer Engineering & Science, 2021, 43(12): 2190-2205.

[1]	于致远, 宋慧慧, . 用于遥感图像时空融合的多尺度全聚合网络[J]. 计算机工程与科学, 2025, 47(05): 864-874.
[2]	尹春勇, 张小虎. 基于Transformer和Text-CNN的日志异常检测[J]. 计算机工程与科学, 2025, 47(03): 448-458.
[3]	徐雯, 于瓅. 基于迭代收缩阈值与深度学习的压缩感知图像重构网络[J]. 计算机工程与科学, 2025, 47(03): 485-493.
[4]	王宇飞, 刘强, 张唯贞, 伍晓洁, 李佳雯, 王煜恒. rtTorTIM：基于多模态特征融合和Stacking集成学习的实时Tor流量识别方法#br#[J]. 计算机工程与科学, 2025, 47(02): 238-246.
[5]	刘拥民, 许成, 黄浩, 张钱垒, 赵俊杰, . 基于SAE和WGAN的入侵检测方法研究[J]. 计算机工程与科学, 2025, 47(02): 256-264.
[6]	许天佑, 高光勇. 基于可逆生成对抗网络的鲁棒图像隐藏[J]. 计算机工程与科学, 2025, 47(02): 288-297.
[7]	吴玉虹, 王建. 基于Patches-CNN的模拟电路故障诊断[J]. 计算机工程与科学, 2025, 47(01): 35-44.
[8]	徐超, 阮荣耀, 陈勇, . 一种基于区块链的医疗数据审计方法[J]. 计算机工程与科学, 2025, 47(01): 95-106.
[9]	陈欣然, 刘宁, 闫中敏, 刘磊, 崔立真. 基于注意力指导的双粒度跨模态医学特征学习框架[J]. 计算机工程与科学, 2025, 47(01): 150-159.
[10]	罗婧, 叶志晟, 杨泽华, 傅天豪, 魏雄, 汪小林, 罗英伟, . 研发类GPU集群任务数据集的构建及分析[J]. 计算机工程与科学, 2024, 46(12): 2128-2137.
[11]	敬超, 闭玉申. 面向深度学习作业的干扰感知在线调度算法研究[J]. 计算机工程与科学, 2024, 46(12): 2138-2148.
[12]	陈磊, 梁正友, 孙宇, 蔡俊民. 多尺度特征融合的移动端单目深度估计研究[J]. 计算机工程与科学, 2024, 46(09): 1616-1524.
[13]	刘强, 李沐春, 伍晓洁, 王煜恒. S-JSMA：一种低扰动冗余的快速JSMA对抗样本生成方法[J]. 计算机工程与科学, 2024, 46(08): 1395-1402.
[14]	徐捷, 邵玉斌, 杜庆治, 龙华, 马迪南. 结合混合特征提取与深度学习的长文本语义相似度计算[J]. 计算机工程与科学, 2024, 46(08): 1513-1520.
[15]	丁建平, 李卫军, 刘雪洋, 陈旭. 命名实体识别研究综述[J]. 计算机工程与科学, 2024, 46(07): 1296-1310.