• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2015, Vol. 37 ›› Issue (10): 1959-1964.

• 论文 • 上一篇    下一篇

一种基于局部加权回归的分类方法

徐晓丹1,2,刘华文2,姚明海1,刘日仙1   

  1. (1.浙江工业大学信息工程学院,浙江 杭州 310023;2.浙江师范大学数理与信息工程学院,浙江 金华 321004)
  • 收稿日期:2015-07-26 修回日期:2015-09-24 出版日期:2015-10-25 发布日期:2015-10-25
  • 基金资助:

    浙江省教育厅资助项目(Y201328291);浙江省自然科学基金资助项目(LZ14F030001,LY14F020012)

A novel classification method based on
locally weighted regression 

XU Xiaodan1,2,LIU Huawen2,YAO Minghai1,LIU Rixian1   

  1. (1.College of Information Engineering,Zhejiang University of Technology,Hangzhou 310023;
    2.College of Mathematics,Physics and Information Engineering,Zhejiang Normal University,Jinhua 321004,China)(1.College of Information Engineering,Zhejiang University of Technology,Hangzhou 310023;
    2.College of Mathematics,Physics and Information Engineering,Zhejiang Normal University,Jinhua 321004,China)
  • Received:2015-07-26 Revised:2015-09-24 Online:2015-10-25 Published:2015-10-25

摘要:

分类是数据挖掘和数据分析中最有应用价值的技术之一。传统的积极学习方法需要预先对模型空间进行假设,并且没有充分考虑到实例之间的相关性,其泛化能力将会受到一定程度的影响。针对上述问题,提出了一种基于新型映射关系的局部加权回归方法MLWR。该算法首先找出测试样本在训练集中的近邻样本,然后建立测试样本和近邻样本的回归函数,根据建立的回归模型和近邻样本的标签,计算得到测试样本的标签。实验与当前流行的多种分类方法在UCI的9个数据集上进行测试。实验结果表明我们的方法能有效地提高分类精度,对较大样本数据也有较好的适用性。

关键词: 分类, 映射关系, 局部加权回归, kNN, 懒惰学习

Abstract:

Classification is one of the most practical techniques in data mining and analysis. Existing classification algorithms based on eager learning require a model assumption and do not address the correlations between individual instances, such that their performance can be affected. In this paper, we propose a new learning method based on the locally weighted regression, called MLWR. For a given test example, the MLWR firstly identifies the neighboring instances in the training set, and a locally weighted regression model is generated from the test instance and its neighboring instances.Then the test label is calculated by using the regression model and the neighboring labels. In the experiments, five classification methods are tested on 9 data sets of UCI. Experiment results show that the performance of the MLWR is superior to other methods and also suitable for big data.

Key words: classification;mapping relationship;locally weighted regression;kNN;lazy learning