• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2014, Vol. 36 ›› Issue (05): 977-985.

• 论文 • Previous Articles     Next Articles

Ensemble classification
based on feature drifting in data streams          

ZHANG Yupei,LIU Shuhui   

  1. (School of Information Engineering,Zhengzhou University,Zhengzhou 450052,China)
  • Received:2012-12-17 Revised:2013-05-18 Online:2014-05-25 Published:2014-05-25

Abstract:

In order to construct an effective classifier for data streams with concept drifting,according to the theory that different data feature has different critical degree for classification,a method of Ensemble Classifier for Feature Drifting in data streams (ECFD) is proposed. Firstly,the definite of feature drifting and the relationship between feature drifting and concept drifting is given.Secondly,mutual information theory is used to propose an Unsupervised Feature Filter (UFF) technique,so that critical feature subsets are extracted to detect feature drifting.Finally, the basic classified algorithms with the capability of handling concept drifting is chosen to construct heterogeneous ensemble classifier on the basis of critical feature subsets. This method exhibits a new idea of way to highdimensional data streams with hidden concept drifting.Experimental results show that the method has strong appearance in accuracy, speed and scalability, especially for highdimensional data streams.
          

Key words: feature selection;feature drifting;concept drifting;data stream;mutual information;ensemble classifier