• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2013, Vol. 35 ›› Issue (3): 150-154.

• 论文 • 上一篇    下一篇

基于改进互信息的信息检索扩展模型

涂伟1,甘丽新2,黄乐辉1,谢志华2   

  1. (1.江西科技师范大学文科综合实验中心,江西 南昌 330038;2.江西科技师范大学光电子与通信重点试验室,江西 南昌 330038)
  • 收稿日期:2011-08-25 修回日期:2012-07-10 出版日期:2013-03-25 发布日期:2013-03-25
  • 基金资助:

    江西省教育厅科技资助项目(GJJ11224,GJJ11225);江西省科技支撑计划资助项目(00029511101228076)

Expanded information retrieval
model based on improved mutual information  

TU Wei1,GAN Lixin2,HUANG Lehui1,XIE Zhihua2   

  1. (1.Center of Arts Complex Laboratory,Jiangxi Science and  Technology Normal University,Nanchang 330038;
    2.Key Laboratory of OpticElectronic and Communication,
    Jiangxi Science and  Technology Normal University,Nanchang 330038,China)
  • Received:2011-08-25 Revised:2012-07-10 Online:2013-03-25 Published:2013-03-25

摘要:

互信息已广泛应用于信息检索扩展模型中。针对互信息存在倾向于低频词、忽略稀疏数据可能导致负相关的潜在影响的问题,本文将改进的互信息方法应用于信息检索扩展模型中。在五个标准数据集上的实验结果表明,本文提出的基于改进互信息的信息检索扩展模型比基于传统互信息的查询扩展模型具有更优的检索性能。

关键词: 查询扩展, 互信息, 信息检索

Abstract:

Mutual Information has been widely applied to many expanded information retrieval models.Aiming at problems in mutual information,for instance,being apt to lowfrequency words and ignoring negative potential impact led by sparse data,this paper applies improved mutual information in an expanded information retrieval model. Experimental results on the five normal datasets show that the expanded information retrieval model based on improved mutual information outperforms that based on traditional mutual information.

Key words: query expansion;mutual information;information retrieval