Title :
Linear-time matrix transpose algorithms using vector register file with diagonal registers
Author :
Hanounik, Bedros ; Hu, Xiaobo
Author_Institution :
Dept. of Comput. Sci. & Eng., Notre Dame Univ., IN, USA
Abstract :
Matrix transpose operation (MT) is used frequently in many multimedia and high performance applications. Therefore, using a faster MT operation results in a shorter execution time of these applications. In this paper we propose two new MT algorithms. The algorithms exploit diagonal register properties to achieve a linear-time execution of MT operation using vector processor that supports diagonal registers. We demonstrate the algorithms as well as proofs, examples, and various enhancements to the proposed algorithms A performance evaluation shows that the proposed algorithms are at least twice as fast as one of the leading MT algorithms such as an algorithm that is implemented using Motorola´s AltiVec architecture (n⩾16). We believe that our work opens new doors to improve the execution time of many two-dimensional operations such as DCT, DFT, and Shearsort
Keywords :
computational complexity; matrix algebra; parallel algorithms; MT operation; diagonal registers; linear-time execution; matrix transpose algorithms; vector register file; Acceleration; Application software; Computer architecture; Computer science; Discrete cosine transforms; Multimedia computing; Registers; Scientific computing; Sections; Vector processors;
Conference_Titel :
Parallel and Distributed Processing Symposium., Proceedings 15th International
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7695-0990-8
DOI :
10.1109/IPDPS.2001.924973