• DocumentCode
    2061475
  • Title

    High performance implementation of tridiagonalization on the SR8000

  • Author

    Naono, Ken ; Yamamoto, Yiisaku ; Igai, Mitsuyoshi ; Hirayama, Hiroyuki

  • Author_Institution
    Central Res. Lab., Hitachi Ltd., Tokyo, Japan
  • Volume
    1
  • fYear
    2000
  • fDate
    14-17 May 2000
  • Firstpage
    206
  • Abstract
    The methods of high performance tridiagonalization on the HITACHI SR8000 are described and evaluated. To achieve high performance, we adopted the blocked tridiagonalization and the scattered square decomposition. In addition, to achieve more performance in one node, we took the ways of the rectangular computation in the diagonal blocks and the loop integration for reducing the number of read/write operations. On one node of the SR8000, we achieved about 4.0 Gflop/s in the 4000-dimension tridiagonalization of a real symmetric matrix. This is much better than the 2.9 Gflop/s of our matrix library´s on the HITACHI S-3800, which has the same peak performance with one node of the SR8000.
  • Keywords
    matrix decomposition; parallel algorithms; parallel machines; performance evaluation; 4.0 Gflop/s; HITACHI SR8000; blocked tridiagonalization; real symmetric matrix; scattered square decomposition; tridiagonalization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing in the Asia-Pacific Region, 2000. Proceedings. The Fourth International Conference/Exhibition on
  • Conference_Location
    Beijing, China
  • Print_ISBN
    0-7695-0589-2
  • Type

    conf

  • DOI
    10.1109/HPC.2000.846547
  • Filename
    846547