• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (07): 1218-1228.

• High Performance Computing • Previous Articles     Next Articles

A method for constructing performance analysis model of high performance application based on random forest classifier

CHAI Xu-qing1,2,3,QIAO  Yi-hang1,2,3,FAN Li-lin1,2,3    

  1. (1.College of Computer and Information Engineering,Henan Normal University,Xinxiang 453007;
    2.High Performance Computing Center,Henan Normal University,Xinxiang 453007;
    3.Henan Engineering Laboratory of Intelligent Commerce and Internet of Things Technology,Xinxiang 453007,China)
  • Received:2023-11-03 Revised:2023-12-22 Accepted:2024-07-25 Online:2024-07-25 Published:2024-07-19

Abstract: Traditional performance analysis methods for high performance applications have shortcomings such as additional overhead during the analysis process and inaccurate analysis results, resulting in users spending more time and domain knowledge. To address these issues, this paper transforms the problem of program performance analysis into a multi-classification problem of unbalanced small sample datasets under high-dimensional features. By collecting 500 pieces of performance data that include seven types of metrics such as the number of process switches, memory utilization, and disk I/O load during program runtime, after data preprocessing such as PCA dimensionality reduction, a program performance problem analysis model is trained using a random forest classifier. Experimental validation shows that the model can identify five types of performance issues, including excessive memory utilization and heavy disk I/O load. To evaluate the effectiveness of the models guidance, this paper collects performance data generated by the HotSpot3D program and the LU-Decomposition program during runtime. Based on the models output guidance, the two validation programs are optimized at the runtime level and the compilation level. Experimental results indicate that the proposed method can effectively guide the optimization of program performance, with speedup ratios of 1.056 and 5.657 for the two programs, respectively.

Key words: Nmon, performance analysis, variational autoencoder, cluster, random forest