• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2016, Vol. 38 ›› Issue (05): 988-996.

• 论文 • Previous Articles     Next Articles

A multiphased improvement for time series
classification based on symbolic aggregation
approximation representation  

SONG Wei1,ZHANG Fan2,YE Yangdong1,HAN Peng3,FAN Ming1   

  1. (1.School of Information Engineering,Zhengzhou University,Zhengzhou 450001;
    2.School of Information Engineering,North China University of Water Resources and Electric Power,Zhengzhou 450045;
    3.State Administration of Taxation,Henan Provincial Office,Zhengzhou 450000,China)
  • Received:2015-11-26 Revised:2016-01-30 Online:2016-05-25 Published:2016-05-25

Abstract:

Classification is one of the basic tasks in data mining, and feature representation and similarity measurement act as the important basis of time series data mining. The symbolic aggregate approximation (SAX) is a typical symbolic representation method which is straightforward and very simple, and which can efficiently converts time series data to a symbolic representation with dimensionality/ noise reduction. But the potential of information loss can affect the accuracy of the classification results. Focusing on the SAX discretization method coupled with the bag of patterns (BOP) representation in classification task, we propose a multiphased approach framework using the AdaBoost algorithm and voting in ensemble learning   to remedy the information loss of the SAX representation. Experimental results show that the proposed method can improve classification accuracy greatly.

Key words: time series;SAX;classification;ensemble learning;multiphased