Title :
Implementation of FFT and CRBLASTER on the Maestro processor
Author :
Suh, Jinwoo ; Mighell, Kenneth J. ; Kang, Dong-In ; Crago, Stephen P.
Author_Institution :
Inf. Sci. Inst., Univ. of Southern California, Arlington, VA, USA
Abstract :
Currently, most microprocessors use multiple cores to increase performance while limiting power usage. Some processors use not just a few cores, but tens of cores or even 100 cores. One such many-core microprocessor is the Maestro processor, which is based on Tilera´s TILE64 processor. The Maestro chip is a 49-core, general-purpose, radiation-hardened processor designed for space applications. The Maestro processor, unlike the TILE64, has a floating-point coprocessor in each core for improved floating point performance. The Maestro processor runs at up to 350 MHz clock frequency. On the Maestro processor, we implemented an FFT and an image processing application called CRBLASTER and evaluated the performance. The FFT is a well-known and commonly used signal processing kernel. CRBLASTER is a parallel-processing image-analysis application that does cosmic-ray rejection on CCD (charge-coupled device) images using the embarrassingly-parallel L. A. COSMIC algorithm. Both applications were written in C. CRBLASTER uses the high-performance computing industry standard Message Passing Interface (MPI) library. The achieved performance of the FFT was up to 3,813 MFLOPS, and the speedup compared to single tile was 46.4 using 49 tiles. The speedup for CRBLASTER, which was memory-bound, was up to 12.5 using 36 tiles.
Keywords :
application program interfaces; coprocessors; fast Fourier transforms; image processing; message passing; multiprocessing systems; CCD image; CRBLASTER application; FFT; MPI library; Maestro chip; Maestro processor; Tilera TILE64 processor; charge-coupled device; cosmic-ray rejection; fast Fourier transform; floating point performance; floating-point coprocessor; image processing application; many-core microprocessor; message passing interface; parallel-processing image-analysis application; radiation-hardened processor; space application; Coprocessors; Educational institutions; Optimization; Process control; Registers; Tiles;
Conference_Titel :
Aerospace Conference, 2012 IEEE
Conference_Location :
Big Sky, MT
Print_ISBN :
978-1-4577-0556-4
DOI :
10.1109/AERO.2012.6187230