DocumentCode
2061475
Title
High performance implementation of tridiagonalization on the SR8000
Author
Naono, Ken ; Yamamoto, Yiisaku ; Igai, Mitsuyoshi ; Hirayama, Hiroyuki
Author_Institution
Central Res. Lab., Hitachi Ltd., Tokyo, Japan
Volume
1
fYear
2000
fDate
14-17 May 2000
Firstpage
206
Abstract
The methods of high performance tridiagonalization on the HITACHI SR8000 are described and evaluated. To achieve high performance, we adopted the blocked tridiagonalization and the scattered square decomposition. In addition, to achieve more performance in one node, we took the ways of the rectangular computation in the diagonal blocks and the loop integration for reducing the number of read/write operations. On one node of the SR8000, we achieved about 4.0 Gflop/s in the 4000-dimension tridiagonalization of a real symmetric matrix. This is much better than the 2.9 Gflop/s of our matrix library´s on the HITACHI S-3800, which has the same peak performance with one node of the SR8000.
Keywords
matrix decomposition; parallel algorithms; parallel machines; performance evaluation; 4.0 Gflop/s; HITACHI SR8000; blocked tridiagonalization; real symmetric matrix; scattered square decomposition; tridiagonalization;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing in the Asia-Pacific Region, 2000. Proceedings. The Fourth International Conference/Exhibition on
Conference_Location
Beijing, China
Print_ISBN
0-7695-0589-2
Type
conf
DOI
10.1109/HPC.2000.846547
Filename
846547
Link To Document