[1]Luebke D, Harris M, Krüger J, et al.GPGPU: General Purpose Computation on Graphics Hardware[C]∥Proc of ACM SIGGRAPH’04,2004:33.
[2]Fan Z, Qiu F, Kaufman A, et al. GPU Cluster for High Performance Computing[C]∥Proc of the 2004 ACM/IEEE Conference on Supercomputing,2004:47.
[3]http://www.top500.org.
[4]Ramkumar B,Strumpen V. Portable Checkpointing for Heterogeneous Architectures[C]∥Proc of the 27th International Symposium on FaultTolerant Computing,1997:5867.
[5]Kirk D. NVIDIA CUDA Software and GPU Parallel Computing Architecture[C]∥Proc of the 6th International Symposium on Memory Management,2007:103104.
[6]Kapasi U J, Rixner S, Dally W J, et al. Programmable Stream Processors[J]. IEEE Computer, 2003,36(8):5462.
[7]Advanced Micro Devices, Inc. AMD Brook+[EB/OL].[20091116].http://ati.amd.com/technology/streamcomputing/AMDBrookplus.pdf.
[8]Open Computing Language[EB/OL].[20091116].http://www.khronos.org/.
[9]CUDA Technical Training Volume I/II[R]. Prepared and Provided by NVIDIA, 2008.
[10]Elnozahy E N, Alvisi L, Wang Y, et al. A Survey of RollbackRecovery Protocols in MessagePassing Systems[J]. ACM Computing Surveys, 2002,34(3):375408.
[11]Plank J S, Li K, Puening M A. Diskless Checkpointing[J]. IEEE Transactions Parallel and Distributed Systems, 1998,9(10):972986.
[12]Compute Visual Profiler 4.0 for NVIDIA CUDA User Guide[R]. DU05162001_v04, 2011.
[13]Sheaffer J, Luebke D, Skadron K. A Hardware Redundancy and Recovery Mechanism for Reliable Scientific Computation on Graphics Processors[C]∥Proc of Graphics Hardware, 2007:5564.
[14]Dimitrov M, Mantor M, Zhou H. Understanding Software Approaches for GPGPU Reliability[C]∥Proc of the 2nd Workshop on General Purpose Processing on Graphics Processing Units,2009:94104.
[15]Krishna C M, YannHang L, Kang G S. Optimization Criteria for Checkpoint Placement[J]. Communication of the ACM, 1984, 27(10):10081012.
[16]Chandy K M,Ramamoorthy C V. Rollback and Recovery Strategies for Computer Programs[J]. IEEE Transactions on Computers, 1972,21(6):546556.
[17]Toueg S, Babaolu . On the Optimum Checkpoint Selection Problem[J]. SIAM Journal on Computing, 1984,13(3):630649.
[18]Upadhyaya S J,Saluja K K. An Experimental Study to Determine Task Size for Rollback Recovery Systems[J]. IEEE Transactions on Computers, 1988,37(7):872877. |