• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science ›› 2025, Vol. 47 ›› Issue (2): 238-246.

• Computer Network and Znformation Security • Previous Articles     Next Articles

rtTorTIM: A real-time Tor traffic identification method based on multi-modal feature fusion and Stacking ensemble learning

WANG Yufei,LIU Qiang,ZHANG Weizhen,WU Xiaojie,LI Jiawen,WANG Yuheng   

  1. (College of Computer Science and Technology,National University of Defense Technology,Changsha 410073,China)
  • Received:2024-07-04 Revised:2024-08-03 Online:2025-02-25 Published:2025-02-21

Abstract: Tor network, as a representative of anonymous networks, offers strong privacy protection while also providing a breeding ground for cybercriminal activities. Therefore, conducting research on real-time and high-precision identification of Tor network traffic is of great practical significance. To address issues of weak generalization and poor real-time performance in existed research, a Tor network traffic identification method, called rtTorTIM, based on multi-modal feature fusion and Stacking ensemble learning technology is proposed. Specifically, the method firstly extracts features from three modalities: host-level, stream-level, and packet-level of Tor network traffic, and then constructs a feature dataset. Random forest, linear regression, and K-nearest neighbor methods are subsequently selected as base learners, along with a linear neural network for decision fusion, to construct a two-layer Stacking traffic classifier. Comparative experimental results based on ISCX Tor 2016 public dataset show that accuracy, precision, and recall  of the rtTorTIM method   are all 99% in Tor traffic identification, while also demonstrating better performance in terms of real-time classification.

Key words: Tor anonymous network, multi-modal feature extraction, real-time traffic identification, Stacking ensemble learning, machine learning