基于保持节点簇分布的图提示少样本节点分类

摘要/Abstract

摘要： 在图挖掘任务中，基于原型的图提示学习已被广泛视为提升图数据分析性能的有效手段。然而，在少样本节点分类场景下，现有方法存在无标签数据利用不足导致类原型构建不准确以及对图拓扑结构信息利用不充分的问题，这些不足限制了图提示学习方法在下游任务中的效果。为此，提出了一种融合所有节点簇分布的图提示学习方法PNCD-GP，旨在通过充分利用无标签数据的簇分布和拓扑结构信息，提升分析的性能和准确性。在预训练阶段，采用预测掩码和保持图节点聚类作为优化策略，以学习具有判别力的表示，缩小上下游任务之间的差距。在提示微调模型学习阶段，在原始图中引入类原型虚拟节点作为提示，利用高阶信息增强拓扑结构，提升模型对图结构的理解和利用;通过保持无标签样本与有标签节点的簇分布来学习提示。该方法能够构建更精准的原型向量，并利用类原型与节点表示的相似性进行节点分类。在多个公开图数据集上的实验结果表明，PNCD-GP方法在效率与准确率方面均有显著优势。

关键词: 图挖掘, 图提示学习, 图神经网络, 少样本, 聚类

Abstract: In graph mining tasks, prototype-based graph prompt learning has been widely regarded as an effective means to improve the performance of graph data analysis. However, in the scenario of fest-sample node classification, existing methods have problems such as inaccurate construction of class prototypes due to insufficient utilization of unlabeled data and inadequate utilization of graph topological structure information. These deficiencies limit the effectiveness of graph prompt learning methods in downstream tasks. To this end, a graph prompt learning method PNCD-GP that integrates the cluster distribution of all nodes is proposed, aiming to improve the performance and accuracy of the analysis by fully utilizing the cluster distribution and topological structure information of unlabeled data. In the pre-training stage, predictive masks and preserving graph node clustering are adopted as optimization strategies to learn discriminative representations and narrow the gap between upstream and downstream tasks. During the prompt fine-tuning model learning stage, class prototype virtual nodes are introduced into the original graph as prompts, and high-order information is utilized to enhance the topological structure, thereby improving the model's understanding and utilization of the graph structure. Learn cues by maintaining the cluster distribution of unlabeled samples and labeled nodes. This method can construct more accurate prototype vectors and classify nodes by utilizing the similarity between class prototypes and node representations. Experimental results on multiple public graph datasets show that the PNCD-GP method has significant advantages in both efficiency and accuracy.

Key words: graph mining, graph prompt learning, graph neural network, few-shot, clustering

谢秋园, 李秋瑶, 柴变芳. 基于保持节点簇分布的图提示少样本节点分类[J]. 计算机工程与科学.

XIE Qiuyuan, LI Qiuyao, CHAI Bianfang. [J]. Computer Engineering & Science.

[1]	王煜恒, 刘强, 伍晓洁. RCGNN：图注入攻击下的图神经网络鲁棒性认证方法[J]. 计算机工程与科学, 2025, 47(3): 434-447.
[2]	景永俊, 王浩, 邵堃, 王晓峰. 一种基于图热核扩散卷积的网络入侵检测方法[J]. 计算机工程与科学, 2025, 47(3): 459-471.
[3]	陈宇灵, 李翔. 基于图结构提示实现低资源场景下的节点分类[J]. 计算机工程与科学, 2025, 47(3): 534-547.
[4]	侯萱, 梁志贞, 张磊, 刘佰龙, 张雪飞. 基于上下文全局空间图的轨迹用户链接[J]. 计算机工程与科学, 2025, 47(2): 336-348.
[5]	朱嘉骏, 包美凯, 张凯, 刘烨, 刘淇. 基于多源知识注入的常识问答方法研究[J]. 计算机工程与科学, 2025, 47(2): 349-360.
[6]	李瑞红, 李晓红, 姚锦, 王闪闪. 基于双通道异质超图神经网络的引文推荐方法[J]. 计算机工程与科学, 2025, 47(2): 361-369.
[7]	李航, 陈志刚, 王易杰, 张心宇, 雷惊鸿, 刘凌枫. 基于时空图注意力状态空间模型的人体姿态异常检测研究[J]. 计算机工程与科学, 2025, 47(10): 1830-1840.
[8]	吴斯琦, 赵清华, 于雨晨. 基于元学习的图神经网络冷启动推荐[J]. 计算机工程与科学, 2024, 46(9): 1675-1684.
[9]	袁佳伟, 赵进. 基于图神经网络的OMCI模型相似性计算[J]. 计算机工程与科学, 2024, 46(9): 1576-1586.
[10]	李清风, 金柳, 马慧芳, 张若一. 双视图对比学习引导的多行为推荐方法[J]. 计算机工程与科学, 2024, 46(4): 707-715.
[11]	余天赐, 高尚. 融合多结构信息的代码注释生成模型[J]. 计算机工程与科学, 2024, 46(4): 667-675.
[12]	王谢中, 陈旭, 景永俊, 王叔洋. 基于异构图神经网络的半监督网站主题分类[J]. 计算机工程与科学, 2024, 46(4): 635-646.
[13]	马雪, 何星星, 兰咏琪, 李莹芳. 一阶逻辑中基于treelet图神经网络的前提选择[J]. 计算机工程与科学, 2024, 46(2): 374-380.
[14]	张悦, 张磊, 刘佰龙, 梁志贞, 张雪飞. 基于时空Transformer的多空间尺度交通预测模型[J]. 计算机工程与科学, 2024, 46(10): 1852-1863.
[15]	陈昌奉, 赵宏州, 周恺卿. 基于图神经网络的代码抄袭检测方法[J]. 计算机工程与科学, 2024, 46(10): 1815-1824.

基于保持节点簇分布的图提示少样本节点分类

PDF

可视化

摘要/Abstract

引用本文

使用本文

相关文章 15

编辑推荐

Metrics

本文评价