• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2025, Vol. 47 ›› Issue (7): 1226-1236.

• Software Engineering • Previous Articles     Next Articles

BotChecker:A Transformer-based GitHub bot detection model#br#

ZHANG Jin1,3,WU Xingjin1,ZHANG Yang2,XU Shunyu1   

  1. (1.College of Information Science and Engineering,Hunan Normal University,Changsha 410081;
    2.College of Computer Science and Technology,National University of Defense Technology,Changsha 410073;
    3.School of Computer and Communication Engineering,
    Changsha University of Science and Technology,Changsha 410114,China)
  • Received:2023-12-15 Revised:2024-03-25 Online:2025-07-25 Published:2025-08-25

Abstract: In open-source software,accurately identifying  software development assistant robots(Bots) and human contributors is crucial for understanding and evaluating contribution activities.Given the outstanding performance of deep learning models in NLP and software engineering-related fields,this paper proposes BotChecker,a Transformer-based automated bot detection model.By incorporating enhanced fully connected layers and a dedicated binary classifier structure into the Transformer,the model can effectively learn from comment text data of bot and human accounts to detect bots.Experiments validate the effectiveness of BotChecker in bot detection tasks,achieving Accuracy,Recall,and F1-score of 0.941,0.894,and 0.938,respectively.Furthermore,this paper analyzes the impact of model hyperparameters and zero-shot learning on BotChecker’s performance.The proposed model can provide technical support for bot account identification in open-source communities and serve as a methodological benchmark for future research.

Key words: open-source platform, Bot detection technique, empirical analysis, text processing