• DocumentCode
    506181
  • Title

    Macrotasking the singluar value decomposition of block circulant matrices on the Cray-2

  • Author

    Baker, John R.

  • Author_Institution
    Department of Electrical Engineering and Computer Sciences, University of California, Berkeley and Research Medicine and Radiation Biophysics Division, Lawrence Berkeley Laboratory
  • fYear
    1989
  • fDate
    12-17 Nov. 1989
  • Firstpage
    243
  • Lastpage
    247
  • Abstract
    A parallel algorithm to compute the singular value decomposition (SVD) of block circulant matrices on the Cray-2 is described. For a block circulant form described by M blocks with m x n elements in each block, the computation time using an SVD algorithm for general matrices has a lower bound (M3min(m, n)mn). Using a combination of fast Fourier transform (FFT) and SVD steps, the computation time for block circulant singular value decomposition (BCSVD) has a lower bound (Mmin(m, n)mn); a relative savings of ~ M2. Memory usage bounds are reduced from (M2mn) to (Mmn); a relative savings of ~ M. For M = m = n = 64, this decreases the computation time from approximately 12 hours to 30 seconds and memory usage is reduced from 768 megabytes to 12 megabytes. The BCSVD algorithm partitions well into n macrotasks with a granularity of (mM log M) for the FFT portion of the algorithm. The SVD portion of the algorithm partitions into M macrotasks with a granularity of (min(m, n)mn). Again, for the case where M = m = n = 64, the FFT granularity is 29ms and the SVD granularity is 428ms. A speedup of 3.06 was achieved by using a prescheduled partitioning of tasks. The process creation overhead was 2.63ms. Using a more elaborate self-scheduling method with four synchronizing server processes, a speedup of 3.25 was observed with four processors available. The server synchronization overhead was 0.32ms. Relative memory overhead in both cases was about 4% for data space and 40% for code space.
  • Keywords
    Biomedical imaging; Biophysics; Concurrent computing; Equations; Fast Fourier transforms; Laboratories; Matrix decomposition; Partitioning algorithms; Permission; Singular value decomposition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Supercomputing, 1989. Supercomputing '89. Proceedings of the 1989 ACM/IEEE Conference on
  • Conference_Location
    Reno, NV, United States
  • Print_ISBN
    0-89791-341-8
  • Type

    conf

  • DOI
    10.1145/76263.76289
  • Filename
    5349016