基于核稀疏表示的属性选择算法

计算机工程与科学

基于核稀疏表示的属性选择算法

吕治政，李扬定，雷聪

（广西师范大学计算机科学与信息工程学院,广西桂林 541004）

收稿日期:2018-10-29 修回日期:2019-07-12 出版日期:2020-01-25 发布日期:2020-01-25
基金资助:
国家重点研发计划(2016YFB1000905)；国家自然科学基金(6117013120)；国家973项目(2013CB329404)；中国博士后科学基金(2015M570837)；广西自然科学基金(2015GXNSFCB139011)

A feature selection algorithm based

on kernel sparse representation

Lv Zhi-zheng，LI Yang-ding，LEI Cong

（College of Computer Science and Information Engineering,Guangxi Normal University,Guilin 541004,China）

Received:2018-10-29 Revised:2019-07-12 Online:2020-01-25 Published:2020-01-25

摘要/Abstract

摘要：

为解决高维数据在分类时造成的“维数灾难”问题，提出一种新的将核函数与稀疏学习相结合的属性选择算法。具体地，首先将每一维属性利用核函数映射到核空间，在此高维核空间上执行线性属性选择，从而实现低维空间上的非线性属性选择；其次，对映射到核空间上的属性进行稀疏重构，得到原始数据集的一种稀疏表达方式；接着利用L1范数构建属性评分选择机制，选出最优属性子集；最后，将属性选择后的数据用于分类实验。在公开数据集上的实验结果表明，该算法能够较好地实现属性选择，与对比算法相比分类准确率提高了约3%。

关键词: 属性选择, 非线性, 核函数, 稀疏学习, L1范数

Abstract:

In order to solve the “dimension disaster” problem caused by high-dimensional data classification, the paper proposes a new feature selection algorithm combining kernel function with sparse learning. Specifically, the kernel function is firstly used to map every dimensional feature to the kernel space, and linear feature selection is performed in the high dimensional kernel space to achieve nonlinear feature selection in the low dimensional space. Secondly, sparse reconstruction is performed on the features mapped to the kernel space, so as to gain a sparse representation of the original dataset. Next, L1-norm is used to construct a feature selection mechanism and selects the optimal feature subset. Finally, the data after the feature selection is used in the classification experiments. Experimental results on public datasets show that, compared with the comparison algorithm, the proposed algorithm can conduct the feature selection better and improve the classification accuracy by about 3%.

Key words: feature selection, nonlinear, kernel function, sparse learning, L1-norm

吕治政, 李扬定, 雷聪. 基于核稀疏表示的属性选择算法[J]. 计算机工程与科学.

Lv Zhi-zheng, LI Yang-ding, LEI Cong.

A feature selection algorithm based

on kernel sparse representation

[J]. Computer Engineering & Science.

[1]	薛凯来, 贾向东, 韩向花, 牛夏秧, 张亮. 基于非线性能量收集的非线性信息状态更新系统的信息新鲜度分析#br#[J]. 计算机工程与科学, 2025, 47(04): 644-654.
[2]	王坤, 刘杰, 李伟, 谭伟, 覃涛, 杨靖, . 多策略改进的猎人猎物优化算法[J]. 计算机工程与科学, 2024, 46(10): 1875-1887.
[3]	戴春雨, 马廉洁, 蒋涵存, 李红双. 基于多种策略改进的鲸鱼优化算法[J]. 计算机工程与科学, 2024, 46(09): 1635-1647.
[4]	赵瑞平, 降爱莲. 基于自编码器和局部嵌入的无监督特征选择[J]. 计算机工程与科学, 2023, 45(07): 1282-1291.
[5]	黄学雨, 罗华. 自适应变异蝴蝶优化算法[J]. 计算机工程与科学, 2023, 45(06): 1123-1133.
[6]	寇巧媛, 袁杰. 具有时变通信延迟的多智能体系统改进蜂拥控制[J]. 计算机工程与科学, 2022, 44(10): 1852-1860.
[7]	耿召里, 李目, 曹淑睿, 刘昶忻. 基于混合反向学习策略的鲸鱼优化算法[J]. 计算机工程与科学, 2022, 44(02): 355-363.
[8]	张孟健, 张浩, 陈曦, 杨靖. 基于Cubic映射的灰狼优化算法及应用[J]. 计算机工程与科学, 2021, 43(11): 2035-2042.
[9]	于琼, 田宪. 基于组合模型的非线性时间序列预测算法[J]. 计算机工程与科学, 2021, 43(10): 1817-1825.
[10]	李晶晶, 许少华. 基于组合核函数的径向基过程神经网络及其在示功图诊断中的应用[J]. 计算机工程与科学, 2021, 43(04): 746-752.
[11]	熊炜1,2，金靖熠1，王娟1，刘敏1，曾春艳1. 基于深度学习特征点法的单目视觉里程计[J]. 计算机工程与科学, 2020, 42(01): 117-124.
[12]	楚恒1,2,3,4，蔡衡1,2,3，单德明1,2,3. 高分辨率遥感影像的多特征多核ELM分类方法[J]. 计算机工程与科学, 2019, 41(10): 1816-1822.
[13]	常戬，刘旺，白佳弘. 基于图像融合技术的Retinex图像增强算法[J]. 计算机工程与科学, 2018, 40(09): 1624-1635.
[14]	廖淑娇1,2，朱清新1，梁锐1. 测试代价受限下数据的属性和粒度选择方法[J]. 计算机工程与科学, 2018, 40(08): 1468-1474.
[15]	蔡国永,毕梦莹,刘建兴. 基于标记信息级联传播树特征的谣言检测新方法[J]. 计算机工程与科学, 2018, 40(08): 1488-1495.