• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2023, Vol. 45 ›› Issue (03): 554-564.

• 人工智能与数据挖掘 • 上一篇    下一篇

基于表示学习的实体对齐方法综述

马赫,王海荣,周北京,孙崇,徐玺   

  1. (北方民族大学计算机科学与工程学院,宁夏 银川 750021)
  • 收稿日期:2022-09-19 修回日期:2022-10-26 接受日期:2023-03-25 出版日期:2023-03-25 发布日期:2023-03-23
  • 基金资助:
    北方民族大学校级科研项目(2021XYZJK06)

Overview of the entity alignment methods based representation learning

MA He,WANG Hai-rong,ZHOU Bei-jing,SUN Chong,XU Xi   

  1. (School of Computer Science and Engineering,North Minzu University,Yinchuan 750021,China)
  • Received:2022-09-19 Revised:2022-10-26 Accepted:2023-03-25 Online:2023-03-25 Published:2023-03-23

摘要: 实体对齐是目前知识融合阶段的主要工作之一,基于表示学习的方法是实体对齐的主要研究方向。首先,通过全面地研究当前代表性的实体对齐技术,总结出这些技术的特征及架构,并提出了一个捕捉这些技术关键特征的框架;然后根据这些技术使用的知识表示模型将其分成2类:基于Trans的技术和基于GNN的技术;给出了2个当前广泛使用的数据集,搭建了11个有代表性的基于TransE的模型和基于GNN的模型,并在DBP15K上的3个跨语言数据集上进行对比实验;评测主流模型和添加属性或字面等不同侧面信息后的模型的对齐效果,为未来大规模单模态乃至多模态知识图谱实体对齐研究提供参考。

关键词: 知识图谱;实体对齐;知识表示;侧面信息;相似性 ,  ,   

Abstract: Entity alignment is one of the main tasks in the stage of knowledge fusion. Representation learning is the main research direction of entity alignment. Firstly, after a thorough study of the current representative entity alignment techniques, the characteristics and architecture of these methods are described, and a framework to capture the key features of these techniques is proposed. Then, based on the knowledge representation technologies they use, they are divided into two categories: Trans-based and GNN-based. Two currently widely used datasets are summarized, and 11 representative models of the above two categories are built. These models run on three datasets of the DBP15k cross-language dataset in the comparative experiments. Finally, this paper evaluates the alignment effect of mainstream models and models with different side information such as attributes and words, and provides a reference for future large-scale single-mode and even multi-modal knowledge map entity alignment studies.

Key words: knowledge graph, entity alignment, knowledge representation, profile information, similarity