• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2010, Vol. 32 ›› Issue (1): 136-140.doi: 10.3969/j.issn.1007130X.2010.

• 论文 • Previous Articles     Next Articles

Research on SLMIR and Its Smoothing Techniques in Chinese QA Systems

  

  1. (Department of Computer Science,Hangzhou Dianzi University,Hangzhou 310018,China)
  • Received:2008-08-02 Revised:2008-11-18 Published:2010-01-18

Abstract:

In order to fit in with the Chinese language characteristics in the QA systems, this paper thoroughly analyzes the information retrieval model. After analyzing and comparing the traditional main IR models, we get a more efficiency IR method, which is SLMIR (an information retrieval method based on statistical language modeling). In addition, we study the best order number N in Ngram and its main data smoothing techniques, compare them by test results, and discusse the relevant factors which affect the data smoothing method,such as the scale of training. Finally, the best smoothing techniques in different conditions are given.

Key words: information retrieval;statistical language model;Ngram;SLMIR;smoothing technique

CLC Number: