DocumentCode :
2774821
Title :
Matrix Transpose on 2D Torus Array Processor
Author :
Zekri, Ahmed S. ; Sedukhin, Stanislav G.
Author_Institution :
The University of Aizu, Japan
fYear :
2006
fDate :
Sept. 2006
Firstpage :
45
Lastpage :
45
Abstract :
Previously, we represented the index space of the (n¿n)- matrix multiply-add problem C=C+A¿B as a 3D torus, where A, B, and C are rolled along the corresponding axes of the index space. All optimal 2D data allocations (resulted from projection) to solve the problem on the n¿n torus array processor in n multiply-add-roll steps were obtained. In this paper, we formulate the operations needed for aligning both the data before computing and the results after computing as matrix multiply-add problems. These alignment operations are combined with the optimal data allocations that solve the matrix multiply-add problem to propose new algorithms to transpose an n¿n matrix on the n¿n torus array processor in O(n) multiply-add-roll steps. Using the proposed algorithms, we showed different approaches to solve the transposed matrix multiply-add problem, C=C+A^T¿B^T , on the 2D torus array processor.
Keywords :
Cities and towns; Delay; Frequency domain analysis; Information technology; Routing; Space technology; Video coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology, 2006. CIT '06. The Sixth IEEE International Conference on
Conference_Location :
Seoul
Print_ISBN :
0-7695-2687-X
Type :
conf
DOI :
10.1109/CIT.2006.117
Filename :
4019867
Link To Document :
بازگشت