DocumentCode
2050385
Title
Achieving Scalable Parallelization for the Hessenberg Factorization
Author
Castaldo, Anthony M. ; Whaley, R. Clint
Author_Institution
Dept. of Comput. Sci., Univ. of Texas at San Antonio, San Antonio, TX, USA
fYear
2011
fDate
26-30 Sept. 2011
Firstpage
65
Lastpage
73
Abstract
Much of dense linear algebra has been successfully blocked to concentrate the majority of its time in the Level 3 BLAS, which are not only efficient for serial computation, but also scale well for parallelism. For the Hessenberg factorization, which is a critical step in computing the eigenvalues and vectors, however, performance of the best known algorithm is still strongly limited by the memory speed, which does not tend to scale well at all. In this paper we present an adaptation of our Parallel Cache Assignment (PCA) technique to the Hessenberg factorization, and show that it achieves super linear speedup over the corresponding serial algorithm and a more than four-fold speedup over the best known algorithm for small and medium sized problems.
Keywords
cache storage; eigenvalues and eigenfunctions; linear algebra; parallel processing; Hessenberg factorization; PCA; achieving scalable parallelization; eigenvalues; linear algebra; memory speed; parallel cache assignment; Aggregates; Eigenvalues and eigenfunctions; Instruction sets; Linear algebra; Linux; Principal component analysis; ATLAS; Hessenberg; LAPACK; factorization; multi-core; multicore; parallel;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing (CLUSTER), 2011 IEEE International Conference on
Conference_Location
Austin, TX
Print_ISBN
978-1-4577-1355-2
Electronic_ISBN
978-0-7695-4516-5
Type
conf
DOI
10.1109/CLUSTER.2011.16
Filename
6061066
Link To Document