[1]Alvin R, Chatterjee L S, Patnala P K, et al. Recursive Array Layouts and Fast Matrix Multiplication[J]. IEEE Trans on Parallel and Distributed Systems, 2002, 13(11):11051123.
[2]Pike G,Hilnger P N. Better Tiling and Array Contraction for Compiling ScienticPrograms[C]∥Proc of the IEEE/ACM Conf on Supercomputing, 2002:112.
[3]Wonnacott D. Using Time Skewing to Eliminate Idle Time due to Memory Bandwidth and Network Limitations[C]∥Proc of Int’l Parallel and Distributed Processing Symp, 2000:171180.
[4]Petrini F, Fossum G, Fernndez J, et al. Multicore Surprise Lessons Learned from Optimizing Sweep3D on the Cell Broadband Engine[C]∥Proc of IPDPS’07, 2007.
[5]刘杰,胡庆丰,韩国兴. 分布式存储环境下非平衡刚性方程组的数值并行计算[J].计算物理,2002, 19(1):8693.
[6]莫则尧,刘兴平,彭力田,等.优化和并行一个数值油藏模拟软件中的解法器[J].石油学报,2000,21(2):5661.
[7]Liu Jie, Chi Lihau, Chen Jin. Parallel Numerical Simulation for the Multigroup Particle Transport Equation[C]∥Proc of 2008 Int’l Symp on Distributed Computing and Applications for Business Engineering and Science, 2008:154160.