• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

Selecting attributes and granularity
for data with test cost constraint
 

LIAO Shujiao1,2,ZHU Qingxin1,LIANG Rui 1   

  1. (1.School of Information and Software Engineering,University of Electronic Science and Technology of China,Chengdu 610054;
    2.School of Mathematics and Statistics,Minnan Normal University,Zhangzhou 363000,China)
     
  • Received:2017-03-20 Revised:2017-05-26 Online:2018-08-25 Published:2018-08-25

Abstract:

Test cost and misclassification cost are commonly considered in costsensitive learning. In real applications, the test cost of a feature is often related to the granularity of attribute values, and the misclassification cost of an object with multiple attributes is usually influenced by the total test cost of attributes. Based on this consideration, the paper studies the selection of attribute and granularity of data in the case where the total test cost is restrained. Aiming at minimizing the average total cost of data processing, a method is proposed to choose the optimal attribute subset and the optimal data granularity simultaneously. We first construct the theoretical model of the proposed method, and then design an efficient algorithm. Experimental results show that the proposed algorithm can effectively select the attributes and the granularity of data under different test cost constraints.
 

Key words: cost, error, neighborhood, attribute selection, granularity selection