DocumentCode
2634814
Title
Folding spatial image filters on the CM-5
Author
Dykes, Sandra G. ; Zhang, Xiaodong
Author_Institution
Performance Comput. & Software Lab., Texas Univ., San Antonio, TX, USA
fYear
1995
fDate
25-28 Apr 1995
Firstpage
451
Lastpage
456
Abstract
This paper presents an efficient data-parallel algorithm for general convolutions, and compares its performance on the CM-5 to FFT frequency filtering. Sequential FFT filters are faster than sequential convolutions for windows beyond a very small size, typically 6×6 pixels. Our folded convolution algorithm shifts the convolution/FFT performance crossover to much larger filter sizes. For 256×256 images on a 512 node CM-5, the folded convolution is faster than FFT-filtering up to 36×36 windows. Results are reported for a naively implemented convolution, our folded convolution with default and optimized memory layouts, and FFT filtering using FFTs from the CM-5 scientific library (CMSSL). The data yield two important results: 1. Parallel convolutions on the CM-5 are faster than FFT filtering for a substantial and important range of window sizes. This is in contrast to sequential systems, where convolutions are more efficient only for very small windows. 2. Considerable performance gains are realized by folding the convolution and optimizing layout
Keywords
fast Fourier transforms; image processing; parallel algorithms; spatial filters; CM-5; convolutions; data-parallel algorithm; parallel convolutions; performance gains; sequential FFT filters; spatial image filters; Convolution; Filtering; Filters; Fourier transforms; Frequency; High performance computing; Image processing; Laboratories; Partitioning algorithms; Software performance;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing Symposium, 1995. Proceedings., 9th International
Conference_Location
Santa Barbara, CA
Print_ISBN
0-8186-7074-6
Type
conf
DOI
10.1109/IPPS.1995.395970
Filename
395970
Link To Document