• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2013, Vol. 35 ›› Issue (4): 8-13.

• 论文 • 上一篇    下一篇

SSTD:基于栈帧分析的可扩展并行程序调试工具

蒋杰1,方力2,卢凯1,刘杰1,武林平3   

  1. (1.国防科学技术大学计算机学院,湖南 长沙 410073;2.国防科学技术大学网络中心,湖南 长沙 410073;
    3.北京应用物理与计算数学研究所高性能计算中心,北京 100088)
  • 收稿日期:2012-04-11 修回日期:2012-08-15 出版日期:2013-04-25 发布日期:2013-04-25
  • 基金资助:

    国家863计划资助项目(2012AA01A301,2012AA010903);国家自然科学基金资助项目(60970033)

SSTD:Scalable parallel debug tool based on stack trace analysis    

JIANG Jie1,FANG Li2,LU Kai1,LIU Jie1,WU Linping3   

  1. (1.School of Computer Science,National University of Defense Technology,Changsha 410073;
    2.Network Center,National University of Defense Technology,Changsha 410073;
    3.High Performance Computing Center,Institute of Applied Physics and Computational Mathematics,Beijing 100088,China)
  • Received:2012-04-11 Revised:2012-08-15 Online:2013-04-25 Published:2013-04-25

摘要:

设计了一个基于栈帧分析的可扩展并行调试工具SSTD,在并行程序异常退出时获取进程栈帧信息,辅助用户快速定位程序故障,提高调试效率。详细讨论了SSTD系统架构以及可扩展通信结构、进程等价类划分等关键技术。

关键词: 并行调试, 栈帧分析, 可扩展

Abstract:

In this paper, a scalable parallel debug tool (SSTD) was proposed. By capturing and dumping process stack traces when the target parallel program exits abnormally, the SSTD can help users fast locate program faults and improve parallel debug efficiency. The system architecture and some key technologies of the SSTD, including scalable communication infrastructure and process equivalence classification, were discussed in detail.

Key words: parallel debug;stack trace analysis;scalable