DocumentCode :
2634814
Title :
Folding spatial image filters on the CM-5
Author :
Dykes, Sandra G. ; Zhang, Xiaodong
Author_Institution :
Performance Comput. & Software Lab., Texas Univ., San Antonio, TX, USA
fYear :
1995
fDate :
25-28 Apr 1995
Firstpage :
451
Lastpage :
456
Abstract :
This paper presents an efficient data-parallel algorithm for general convolutions, and compares its performance on the CM-5 to FFT frequency filtering. Sequential FFT filters are faster than sequential convolutions for windows beyond a very small size, typically 6×6 pixels. Our folded convolution algorithm shifts the convolution/FFT performance crossover to much larger filter sizes. For 256×256 images on a 512 node CM-5, the folded convolution is faster than FFT-filtering up to 36×36 windows. Results are reported for a naively implemented convolution, our folded convolution with default and optimized memory layouts, and FFT filtering using FFTs from the CM-5 scientific library (CMSSL). The data yield two important results: 1. Parallel convolutions on the CM-5 are faster than FFT filtering for a substantial and important range of window sizes. This is in contrast to sequential systems, where convolutions are more efficient only for very small windows. 2. Considerable performance gains are realized by folding the convolution and optimizing layout
Keywords :
fast Fourier transforms; image processing; parallel algorithms; spatial filters; CM-5; convolutions; data-parallel algorithm; parallel convolutions; performance gains; sequential FFT filters; spatial image filters; Convolution; Filtering; Filters; Fourier transforms; Frequency; High performance computing; Image processing; Laboratories; Partitioning algorithms; Software performance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing Symposium, 1995. Proceedings., 9th International
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-8186-7074-6
Type :
conf
DOI :
10.1109/IPPS.1995.395970
Filename :
395970
Link To Document :
بازگشت