• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2022, Vol. 44 ›› Issue (08): 1467-1473.

• Artificial Intelligence and Data Mining • Previous Articles     Next Articles

Text feature selection based on sine and cosine algorithm

WEN Wu1,2,3,WAN Yu-hui1,2,WEN Zhi-yun1,2   

  1. (1.School of Communication and Information Engineering,
    Chongqing University of Posts and Telecommunications,Chongqing 400065;
    2.Research Center of New Telecommunication Technology Applications,
    Chongqing University of Posts and Telecommunications,Chongqing 400065;
    3.Chongqing Information Technology Designing Co.,Ltd.,Chongqing 401121,China)
  • Received:2020-07-10 Revised:2021-01-19 Accepted:2022-08-25 Online:2022-08-25 Published:2022-08-25

Abstract: In order to obtain a better feature subset in the text and eliminate interference and redundant features, a hybrid feature optimization algorithm  combining filtering and swarm intelligence algorithm is proposed. Firstly, the information gain value of each feature word is calculated, the better feature is selected as the preselected feature set, and then the sine cosine algorithm is used to optimize the preselected feature to obtain the selected feature set. In order to better balance the global search and local development capabilities in the sine-cosine algorithm, adaptive inertia weights are added. To more accurately evaluate feature subsets, a fitness function weighted by the number of features and accuracy is introduced, and a new location update mechanism is proposed. Experiment results on KNN and Bayesian classifier show that this feature selection model improves the classification accuracy, compared with other feature selection methods and the model before improvement.

Key words: feature selection, sine and cosine, inertia weight, classification accuracy