• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2022, Vol. 44 ›› Issue (10): 1730-1737.

• High Performance Computing • Previous Articles     Next Articles

An online game user churn prediction method based on Spark platform

HU Yan-fang,XIONG Wen,GAO Wei   

  1. (School of Information,Yunnan Normal University,Kunming 650500,China)
  • Received:2022-01-25 Revised:2022-05-18 Accepted:2022-10-25 Online:2022-10-25 Published:2022-10-28

Abstract: With the widespread popularity of the mobile Internet, the domestic online game market has become increasingly saturated. The cost of acquiring new users for game companies continues to increase. How to prevent the loss of existing users has become the focus of marketing. This paper predicts user churn based on a real game log data. First, user features are extracted and computed from log data. Second, a set of important features is selected by weight. Finally, a binary classification model is constructed with features as input and churn as output. 6 common algorithms such as random forest, support vector machine, multi-layer perceptron, gradient boosting decision tree, and logistic regression are comprehensively compared. The experimental results show that the random forest algorithm performs the best, and its model prediction accuracy reaches 91%.

Key words: churn prediction;Spark, binary classification, machine learning, random forest