• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2020, Vol. 42 ›› Issue (08): 1414-1422.

Previous Articles     Next Articles

Prediction of breast cancer based on C-AdaBoost model

LI Yong1,CHEN Si-xuan1,JIA Hai2,WANG Xia2   

  1. (1.College of Computer Science and Engineering,Northwest Normal University,Lanzhou 730070;

    2.Department of Pharmacy,the People’s Hospital of Gansu Province,Lanzhou 730000,China)

  • Received:2019-12-12 Revised:2020-02-27 Accepted:2020-08-25 Online:2020-08-25 Published:2020-08-29

Abstract: Machine learning and deep learning techniques can be used to solve many problems in me- dical classification prediction. Among them, some have higher prediction accuracy, but the others have limited accuracy. This paper proposes an ensemble learning algorithm based on C-AdaBoost model to predict breast cancer diseases. Stepwise regression is used to re-select existing features. The C-AdaBoost model is combined to make the prediction better. A large number of experiments show that 1) the optimal combination of features, that determines whether breast cancer recurs and whether breast cancer is benign, are found, and 2) the proposed ensemble learning algorithm based on C-AdaBoost improves the prediction accuracy by at most 19.5% in comparison to the machine learning classifiers such as SVM, Naive Bayes, RandomForest and traditional ensemble learning models, which can better help doctors make clinical decisions.


Key words: ensemble learning, stepwise regression, feature selection, disease prediction