ERC-KG：结合大语言模型的领域知识图谱构建方法

摘要/Abstract

摘要： 传统知识图谱构建主要依赖于数据预处理、实体识别、关系抽取以及实体对齐等技术手段，此类方法通常伴随高昂的计算与时间开销。针对这一问题，提出一种结合大语言模型抽取、检索和纠错的知识图谱构建方法，以优化知识图谱的生成效率与准确性。通过特征词抽取与领域专家知识相结合的方式精准确定知识图谱的实体集合，构建实体语料检索器，筛选与目标实体最相关的上下文语句作为大语言模型的输入。设计提示模板和验证反馈机制，实现高质量三元组抽取，并完成国防科技领域知识图谱构建。实验结果表明，图谱构建精确率达到94.32%，验证了方法的精确性与合理性，为领域知识图谱的快速构建贡献了新的研究思路。

关键词: 大语言模型, 知识图谱构建, 提示学习

Abstract: The construction of traditional knowledge graphs mainly relies on technical means such as data preprocessing, entity recognition, relation extraction, and entity alignment. Such methods are usually accompanied by high computational and time costs. To address this issue, a domain knowledge graph construction meth-od ERC-KG (Extraction Retrieval and Error Correction Knowledge Graph) is proposed, which combines large language models to optimize the efficiency and accuracy of knowledge graph generation. By combining feature word extraction with domain expert knowledge, the entity set of the knowledge graph is precisely determined. An entity corpus retriever is con-structed to select the context sentences most relevant to the target entity as the input of the large language model. A prompt template and verification feedback mechanism are designed to achieve high-quality triple extraction and complete the construction of the knowledge graph in the field of national defense science and technology. Experimental results show that the accuracy of the graph construction reaches 94.32%, verifying the accuracy and rationality of the method and also contributing new research ideas for the rapid construction of domain knowledge graphs.

Key words: large language model, knowledge graph construction, prompt learning

李相成, 汪永伟, 李强, 刘鹏程, 唐继鹏. ERC-KG：结合大语言模型的领域知识图谱构建方法[J]. 计算机工程与科学.

LI Xiangcheng, WANG Yongwei, LI Qiang, LIU Pengcheng, TANG Jipeng. ERC-KG: A Method for Constructing Domain Knowledge Graphs by Integrating Large Language Models[J]. Computer Engineering & Science.

[1]	田宇, 李军辉, 朱苏阳, 周国栋. 基于数据增强的对话情绪识别[J]. 计算机工程与科学, 2026, 48(2): 330-340.
[2]	付启航, 秦永彬, 黄瑞章, 周裕林, 胡青青. 基于多阶段协同推理的大语言模型司法问答框架[J]. 计算机工程与科学, 2026, 48(2): 268-276.
[3]	李鹤, 迟昊昂, 刘明宇, 杨文婧. 一种基于因果关系的减轻大语言模型幻觉的方法[J]. 计算机工程与科学, 2026, 48(2): 245-255.
[4]	高福财, 何廷年, 杨阳, 杨江伟. GPR:一种大语言模型增强的方法[J]. 计算机工程与科学, 2026, 48(1): 162-171.
[5]	徐春, 孙恩威, 汪晓洁. 基于知识和数据双驱动的DRG医疗问答研究[J]. 计算机工程与科学, 2025, 47(6): 1121-1132.
[6]	曾垂振1, 2, 崔良中1, 马文卓2. 基于ERNIE模型的雷达维修命名实体识别研究[J]. 计算机工程与科学, 2025, 47(6): 1106-1113.
[7]	陈宇灵, 李翔. 基于图结构提示实现低资源场景下的节点分类[J]. 计算机工程与科学, 2025, 47(3): 534-547.
[8]	唐晋韬, 张成贤, 鲍琛龙, 李文静. 基于大语言模型的面向领域的非连续命名实体识别[J]. 计算机工程与科学, 2025, 47(12): 2253-2260.
[9]	刘高, 徐建良, 张先轶, 刘贤冬. OpenLM：多平台高性能的大语言模型推理框架[J]. 计算机工程与科学, 2025, 47(12): 2129-2138.
[10]	裴炳森, 李欣, 樊志杰, 蒋章涛, 孙昊扬, 刘梓锐. 基于大语言模型的司法文本摘要研究[J]. 计算机工程与科学, 2025, 47(11): 2008-2018.

ERC-KG：结合大语言模型的领域知识图谱构建方法

ERC-KG: A Method for Constructing Domain Knowledge Graphs by Integrating Large Language Models

PDF

可视化

摘要/Abstract

引用本文

使用本文

相关文章 10

编辑推荐

Metrics

本文评价