J4 ›› 2016, Vol. 38 ›› Issue (02): 210-216.
• 论文 • Previous Articles Next Articles
LU Qingnan,LIU Zhong
Received:
Revised:
Online:
Published:
Abstract:
We propose a vectorization method of QR decomposition with Givens rotation on Matrix processors. According to the systematic characteristics of Matrix architecture, the computation tasks are evenly distributed to all vector processing elements by optimizing the memory access to vector data and calculation. We also design a double DMA buffering scheme to smooth the data transfers, which can fully overlap the kernel computation time and the DMA data transfer time so that the kernel computation is always at its peak speed and the best computation efficiency is achieved. Experimental results show that the proposal can achieve higher computation efficiency and performance speedup.
Key words: QR decomposition;vector processor;Givens rotation;software pipeline
LU Qingnan,LIU Zhong. A vectorization method of QR decomposition based on Matrix [J]. J4, 2016, 38(02): 210-216.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2016/V38/I02/210