• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2016, Vol. 38 ›› Issue (07): 1330-1337.

• 论文 • Previous Articles     Next Articles

130X.2016.07.005An ensemble feature selection algorithm
for high dimensional microarray data 

SUN Gang1,2,ZHANG Jing1,3   

  1. (1.School of Computer and Information,Hefei University of Technology,Hefei 230009;
    2.School of Computer and Information Engineering,Fuyang Teachers College,Fuyang 236037;
    3.State Grid Anhui Information and Telecommunication Company,Hefei 230061,China)
  • Received:2015-05-25 Revised:2015-09-01 Online:2016-07-25 Published:2016-07-25

Abstract:

Feature selection algorithms are an important tool for microarray data analysis, thus their classification ability and stability are essential for data analysis. We propose an ensemble feature selection algorithm for high dimensional microarray data to compensate for the lack of information on a single gene subset. We firstly adopt the signal noise ratio method to select discriminative genes, and then generate relevant gene subsets by evaluating the correlation between the candidate gene and discriminative gene through conditional correlation coefficients. We finally integrate resemblant gene subsets through the ensemble learning technology. Experimental results show that in most cases the classification ability and stability of the proposed algorithm is superior to those that select only a single gene subset.

Key words: microarray data;signal noise ratio;conditional correlation coefficient;feature selection