DocumentCode
2641712
Title
Implementation of Block Algorithm for LU Factorization
Author
Chen, Jianping ; Ji, Kun ; Shi, Zhenguo ; Liu, Weifu
Author_Institution
Sch. of Comput. Sci. & Technol., Nantong Univ., Nantong, China
Volume
2
fYear
2009
fDate
March 31 2009-April 2 2009
Firstpage
569
Lastpage
573
Abstract
Applying blocking technology to the computation of big dense matrices can make a better use of computerpsilas memory hierarchies and increase computing efficiency. The blocked algorithm for LU factorization is studied in this paper. Efficient algorithms are designed for the computation of the different matrix operations involved in the blocked LU factorization algorithm. Optimization techniques including matrix transposing and loop unrolling are used in the implementation of the matrix computations. Experimental results show that the block LU factorization algorithm runs much faster than the standard LU factorization. A speedup of more than 50% is achieved.
Keywords
C++ language; mathematics computing; matrix decomposition; blocked LU factorization algorithm; blocking technology; computer memory hierarchy; computing efficiency; loop unrolling; matrix operation; matrix transposition; Algorithm design and analysis; Computer science; Equations; Linear systems; Matrices; Partitioning algorithms;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location
Los Angeles, CA
Print_ISBN
978-0-7695-3507-4
Type
conf
DOI
10.1109/CSIE.2009.814
Filename
5171403
Link To Document