J4 ›› 2011, Vol. 33 ›› Issue (9): 130-135.
• 论文 • Previous Articles Next Articles
GU Qiong,YUAN Lei,NING Bin,XIONG Qijun,HUA Li,LI Wenxin
Received:
Revised:
Online:
Published:
Abstract:
Most studies on the imbalanced data set classification focus on the discussion of resampling or costsensitive learning systems themselves; however, the fact that the costs of imbalanced class distribution and unequal misclassification errors always occur simultaneously is neglected. We propose a novel cost sensitive learning (CSL) algorithm which combines the methods of resampling and the CSL techniques together in order to solve the misclassification problem of imbalanced data set. On one hand, the resampling technique allows the balanced data sets by reconstructing both the majority and the minority class. On the other hand, the classification is performed based on the minimal misclassification cost but not the maximal accuracy. Here the misclassification cost for the minority class is much higher than the misclassification cost for the majority class. A costsensitive learning procedure is then conducted for classification. The experimental results show that the proposed method can improve the classification accuracy and decrease the misclassification cost effectively, and the algorithm is superior to the traditional algorithms as for dealing with the imbalanced problem.
Key words: classification;imbalanced dataset;hybrid resampling;cost sensitive learning
GU Qiong,YUAN Lei,NING Bin,XIONG Qijun,HUA Li,LI Wenxin. A Novel Cost Sensitive Learning Algorithm Based on Resampling[J]. J4, 2011, 33(9): 130-135.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2011/V33/I9/130