• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学

• 论文 • 上一篇    下一篇

改进朴素贝叶斯模型的复杂网络关系预测

伍杰华1,2,沈静1,周蓓1   

  1. (1.广东工贸职业技术学院计算机工程系,广东 广州 510510;2.华南理工大学计算机科学与工程学院,广东 广州 510641)
  • 收稿日期:2015-12-21 修回日期:2016-06-23 出版日期:2017-10-25 发布日期:2017-10-25
  • 基金资助:

    广东教育研究专项项目(GDJY-2014-B-B200);广东高等职业技术教育研究会项目( GDGZ14Y037)

An enhanced naive Bayesian relationship
prediction model in complex networks

WU Jie-hua1,2,SHEN Jing1,ZHOU Bei1   

  1. (1.Department of Computer Science and Engineering,Guangdong College of Industry and Commerce,Guangzhou 510510;
    2.School of Computer Science and Engineering,South China University of Technology,Guangzhou 510641,China)
  • Received:2015-12-21 Revised:2016-06-23 Online:2017-10-25 Published:2017-10-25

摘要:

复杂网络包括生物性信息网络、科学家合作网络、社交关系网络等,研究复杂网络的关系预测问题有助于预测蛋白质相互关系,发现科学家合作关系,以及挖掘潜在好友关系等。目前,绝大多数关系预测算法由复杂网络的相似度模型实现,但该类型算法基于显式的网络拓扑特征构建,忽视了影响关系生成的隐含信息。针对这一问题,在朴素贝叶斯链接预测模型(LNB)基础上提出了一种加强(Enhanced)朴素贝叶斯链接预测模型(ELNB),该模型通过定义共邻节点关系概率对共邻节点构成的局部子图特征进行建模,有效缓解了LNB中的独立性假设,实现了共邻节点关系贡献的量化计算。在人工数据集和真实复杂网络数据集上的实验表明,本文提出的模型优于基准算法和其他新近提出的模型。同时,把ELNB的思想有效地拓展到其他基于共邻节点的相似度算法中,为该类模型的研究提供一种新的方案。

关键词: 复杂网络, 贝叶斯模型, 关系预测, 关系挖掘

Abstract:

Complex networks include biological information networks, collaboration networks and social networks. Studying the relationship prediction of complex networks helps predict relationship between proteins, find out cooperation relationship among scientists, as well as mine potential social networks. Currently, most relationship prediction algorithms are realized by similarity-based models, however, this type of algorithms based on network topology feature are explicitly constructed, which ignore latent information behind generated relationship. To solve this problem, we propose an enhanced naive Bayesian relation prediction model (ELNB), which defines a conditional probability to model the local sub-graph structure. It can effectively alleviate the independence assumption of LNB and realize a quantitative calculation of neighbors contribution. Experiments on artificial datasets and real datasets show that the proposed model is better than the baselines and some recently proposed models. Meanwhile,the idea of ELNB can be extended to other similarity algorithms based on common neighbor nodes, which provides a new method for the research of such kind of model.
 

Key words: complex network;Bayesian model;relation prediction, relation mining