• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

A hierarchical clustering based feature
selection algorithm for ranking learning
 

MENG Yu-yu,CHEN Shao-li,LIU Xing-chang   

  1. (School of Electronic and Information Engineering,Lanzhou Jiaotong University,Lanzhou 730070,China)
  • Received:2018-06-11 Revised:2018-11-15 Online:2019-12-25 Published:2019-12-25

Abstract:

Large search systems are especially necessary for quick response to user queries. At the same time, strict backend delay constraints must be observed when calculating the feature relevance of candidate documents. Feature selection can improve the machine learning efficiency. Considering the characteristics that most of the initial points of fast feature selection in ranking learning start from the single feature, which has the best ranking effect, this paper first proposes an algorithm of generating initial points of fast feature selection by hierarchical clustering, and applies the algorithm to two existing fast feature selection algorithms. In addition, a new method that makes full use of clustering features is proposed to deal with feature selection. Experiments on two standard datasets show that the proposed algorithm can obtain a smaller feature subset without affecting the accuracy and obtain the best ranking accuracy on a medium subset.
 

Key words: feature selection, ranking learning, hierarchical clustering, greedy search algorithm