• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2023, Vol. 45 ›› Issue (09): 1670-1678.

• Artificial Intelligence and Data Mining • Previous Articles     Next Articles

Blended MOOC video viewing pattern mining based on an improved self-adaptive DBSCAN

WANG Ruo-bin1,GENG Fang-dong1,ZHANG Yong-mei1,SONG Wei1,WANG Wei-feng1,XU Lin2   

  1. (1.School of Information Science and Technology,North China University of Technology,Beijing 100144,China;
    2.STEM,University of South Australia,Adelaide 5095,Australia)
  • Received:2022-02-07 Revised:2022-04-01 Accepted:2023-09-25 Online:2023-09-25 Published:2023-09-12

Abstract: The DBSCAN (Density-Based Spatial Clustering of Applications with Noise) algorithm based on density clustering can automatically perform classification tasks according to data features, and is often used for clustering analysis of complex data sets with noise. However, it has the defects of difficult parameter determination and high degree of human participation, which limits the application of automatic and high-precision mining. To overcome these defects, an adaptive DBSCAN algorithm based on the k-dist graph slope (KSSA-DBSCAN) is proposed. The algorithm can automatically select the appropriate k-dist graph inflection point as the optimal neighborhood based on the slope of the k-dist graph, and automatically determine the optimal density threshold during the clustering iteration process according to the change in the number of clusters, which overcomes the defects of difficult parameter determination and high degree of human participation. KSSA-DBSCAN is compared with DBSCAN and KANN-DBSCAN on six data sets, and the experimental results show that the accuracy of the algorithm is better than that of other algorithms on the four data sets, and the accuracy is increased by up to 25% compared with DBSCAN. When it is applied to the pattern mining of blended MOOC videos viewing behavior data, the results show that the algorithm can effectively and automatically mine the video viewing patterns, further verifying the effectiveness of the algorithm.

Key words: density-based clustering, self-adaptive, k-dist graph, blended MOOC, video viewing pattern