J4 ›› 2011, Vol. 33 ›› Issue (12): 130-135.
• 论文 • Previous Articles Next Articles
HUANG Dong
Received:
Revised:
Online:
Published:
Abstract:
Discretization of continuous data is an important preprocess of classification methods in data mining. This paper presents a balanced discretization algorithm based on the minimum description length principle. It well measures the relationship between the discretized interval and classification errors by proposing a balanced discretization function based on the minimum description length. The approach proposes an effective heuristic discretization algorithm with the aim to find the optimal breakpoint sequence. The simulation results demonstrate that the proposed algorithm achieves more classification and learning ability on the C5.0 decision tree and the naive Bayesian classifier.
Key words: discretization;data mining;minimum description length(MDL);balanced function
HUANG Dong. Bad:A Balanced Discretization Algorithm Based on the Minimum Description Length[J]. J4, 2011, 33(12): 130-135.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2011/V33/I12/130