• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2011, Vol. 33 ›› Issue (11): 132-139.

• 论文 • Previous Articles     Next Articles

Software FaultTolerance Techniques for Transient Faults

XU Jianjun,TAN Qingping,XIONG Yinqiao,TAN Lanfang,LI Jianli   

  1. (School of Computer Science,National University of Defense Technology,Changsha 410073,China)
  • Received:2009-07-10 Revised:2009-12-04 Online:2011-11-25 Published:2011-11-25

Abstract:

Transient faults, which are caused by the radiation of cosmic rays, are always one of the top challenges for computing in space applications. With the continuous progress of integrated circuits, the performance of modern processors are improved significantly, but their dependability are increasingly affected by transient faults. Currently, the techniques for transient fault tolerance can mainly be classified into two types: hardwareimplemented and softwareimplemented. Comparing with the former techniques, the latter are attractive because of their advantages on costs and flexibility. This paper firstly sketches the basic principle of transient fault tolerance and the characteristics of softwareimplemented techniques. Then, the representative techniques of softwareimplemented fault tolerance are introduced and analyzed from different levels. Finally, the properties and defects of the current studies are summarized, and the advices are proposed for the future research trends of softwareimplemented fault tolerance.

Key words: transient fault;soft error;software fault tolerance;redundancy