• DocumentCode
    2634814
  • Title

    Folding spatial image filters on the CM-5

  • Author

    Dykes, Sandra G. ; Zhang, Xiaodong

  • Author_Institution
    Performance Comput. & Software Lab., Texas Univ., San Antonio, TX, USA
  • fYear
    1995
  • fDate
    25-28 Apr 1995
  • Firstpage
    451
  • Lastpage
    456
  • Abstract
    This paper presents an efficient data-parallel algorithm for general convolutions, and compares its performance on the CM-5 to FFT frequency filtering. Sequential FFT filters are faster than sequential convolutions for windows beyond a very small size, typically 6×6 pixels. Our folded convolution algorithm shifts the convolution/FFT performance crossover to much larger filter sizes. For 256×256 images on a 512 node CM-5, the folded convolution is faster than FFT-filtering up to 36×36 windows. Results are reported for a naively implemented convolution, our folded convolution with default and optimized memory layouts, and FFT filtering using FFTs from the CM-5 scientific library (CMSSL). The data yield two important results: 1. Parallel convolutions on the CM-5 are faster than FFT filtering for a substantial and important range of window sizes. This is in contrast to sequential systems, where convolutions are more efficient only for very small windows. 2. Considerable performance gains are realized by folding the convolution and optimizing layout
  • Keywords
    fast Fourier transforms; image processing; parallel algorithms; spatial filters; CM-5; convolutions; data-parallel algorithm; parallel convolutions; performance gains; sequential FFT filters; spatial image filters; Convolution; Filtering; Filters; Fourier transforms; Frequency; High performance computing; Image processing; Laboratories; Partitioning algorithms; Software performance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing Symposium, 1995. Proceedings., 9th International
  • Conference_Location
    Santa Barbara, CA
  • Print_ISBN
    0-8186-7074-6
  • Type

    conf

  • DOI
    10.1109/IPPS.1995.395970
  • Filename
    395970