J4 ›› 2011, Vol. 33 ›› Issue (1): 143-149.doi: 10.3969/j.issn.1007130X.2011.
• 论文 • Previous Articles Next Articles
WANG Junze,HUANG Benxiong,HU Guang,WEN Jie
Received:
Revised:
Online:
Published:
Abstract:
In Communitybased Q&A services(referred to as cQA) such as Baidu Zhidao, question classification is one of the crucial tasks and it is important to organize the questions submitted to the cQA system. The question categorization algorithm for the cQA service needs to get high accuracy, low computation and lowsensitivity to noise. Based on the kullbackLeibler distance classification algorithm, this paper introduces a new question classification approach adopting the idea of language model, named ngram KLD. The experimental results with a large corpus which contains more than 1 million questionanswer pairs show a significant improvement when the ngram KLD algorithm is used. And the ngram KLD algorithm is fit for the actual demand of the question classification task in the cQA service.
Key words: short text classification;KullbackLeibler Distance;language model
WANG Junze,HUANG Benxiong,HU Guang,WEN Jie. A Study of the Question Classification Task in CommunityBased Q&A Services[J]. J4, 2011, 33(1): 143-149.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/10.3969/j.issn.1007130X.2011.
http://joces.nudt.edu.cn/EN/Y2011/V33/I1/143