Computer Engineering & Science
Previous Articles Next Articles
DU Qi,JIANG Hao,LI Kuan,PENG Lin,YANG Can-qun
Received:
Revised:
Online:
Published:
Abstract:
We implement a quad-precision triangular matrix solution with multiple right-hand sides (QTRSM) based on OpenBLAS on the ARMv8 64-bit multi-core processor. We also propose two methods to implement QTRSM. One is based on GCC complier which accepts the long double data type as quad-precision floating-point numbers. The other uses the double-double data type and its corresponding quad-precision addition, subtraction, multiplication and division algorithms to implement QTRSM. We compare the two methods under different matrix sizes. Experimental results show that the two methods have the same accuracy. However, on average the method using double-double format runs 1.6 times faster than the one using long double format. As the number of threads increases, the speedup of the two QTRSM implementation methods are both close to 2.0, which has good scalability.
Key words: ARMv8 64-bit multi-core processor, OpenBLAS, quad-precision, double-double data type, QTRSM
DU Qi,JIANG Hao,LI Kuan,PENG Lin,YANG Can-qun. QTRSM on ARMv8 64-bit multi-core processor[J]. Computer Engineering & Science.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2017/V39/I03/451