• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2013, Vol. 35 ›› Issue (4): 8-13.

• 论文 • Previous Articles     Next Articles

SSTD:Scalable parallel debug tool based on stack trace analysis    

JIANG Jie1,FANG Li2,LU Kai1,LIU Jie1,WU Linping3   

  1. (1.School of Computer Science,National University of Defense Technology,Changsha 410073;
    2.Network Center,National University of Defense Technology,Changsha 410073;
    3.High Performance Computing Center,Institute of Applied Physics and Computational Mathematics,Beijing 100088,China)
  • Received:2012-04-11 Revised:2012-08-15 Online:2013-04-25 Published:2013-04-25

Abstract:

In this paper, a scalable parallel debug tool (SSTD) was proposed. By capturing and dumping process stack traces when the target parallel program exits abnormally, the SSTD can help users fast locate program faults and improve parallel debug efficiency. The system architecture and some key technologies of the SSTD, including scalable communication infrastructure and process equivalence classification, were discussed in detail.

Key words: parallel debug;stack trace analysis;scalable