• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

QTRSM on ARMv8 64-bit multi-core processor

DU Qi,JIANG Hao,LI Kuan,PENG Lin,YANG Can-qun   

  1. (College of Computer,National University of Defense Technology,Changsha 410073,China)
  • Received:2016-09-05 Revised:2016-11-07 Online:2017-03-25 Published:2017-03-25

Abstract:

We implement a quad-precision triangular matrix solution with multiple right-hand sides (QTRSM) based on OpenBLAS on the ARMv8 64-bit multi-core processor. We also propose two methods to implement QTRSM. One is based on GCC complier which accepts the long double data type as quad-precision floating-point numbers. The other uses the double-double data type and its corresponding quad-precision addition, subtraction, multiplication and division algorithms to implement QTRSM. We compare the two methods under different matrix sizes. Experimental results show that the two methods have the same accuracy. However, on average the method using double-double format runs 1.6 times faster than the one using long double format. As the number of threads increases, the speedup of the two QTRSM implementation methods are both close to 2.0, which has good scalability.

Key words: ARMv8 64-bit multi-core processor, OpenBLAS, quad-precision, double-double data type, QTRSM