DocumentCode
1946288
Title
Invited paper: Using OpenCL to evaluate the efficiency of CPUS, GPUS and FPGAS for information filtering
Author
Chen, Doris ; Singh, Deshanand
Author_Institution
Altera Toronto Technol. Center, Toronto, ON, Canada
fYear
2012
fDate
29-31 Aug. 2012
Firstpage
5
Lastpage
12
Abstract
The FPGA can be a tremendously efficient computational fabric for many applications. In particular, the performance to power ratios of FPGA make them attractive solutions to solve the problem of data centers that are constrained largely by power and cooling costs. However, the complexity of the FPGA design flow requires the programmer to understand cycle-accurate details of how data is moved and transformed through the fabric. In this paper, we explore techniques that allow programmers to efficiently use FPGAs at a level of abstraction that is closer to traditional software-centric approaches by using the emerging parallel language, OpenCL. Although the field of high level synthesis has evolved greatly in the last few decades, several fundamental parts were missing from the complete software abstraction of the FPGA. These include standard and portable methods of describing HW/SW codesign, memory hierarchy, data movement and control of parallelism. We believe that OpenCL addresses all of these issues and allows for highly efficient description of FPGA designs with a higher level of abstraction. We demonstrate this premise by examining the performance of a document filtering algorithm, implemented in OpenCL and automatically compiled to a Stratix IV 530 FPGA. We show that our implementation achieves 5.5× and 5.25× better performance per watt ratios than GPU and CPU implementations, respectively.
Keywords
computer centres; data structures; document handling; field programmable gate arrays; graphics processing units; hardware-software codesign; information filtering; logic design; open systems; parallel programming; CPU; CPU implementations; FPGA design flow; GPU; GPU implementations; HW-SW codesign; OPENCL; OpenCL; Stratix IV 530 FPGA; abstraction level; complete software ab- straction; computational fabric; cooling costs; data centers; data movement; document filtering algorithm; high level synthesis; information filtering; memory hierarchy; parallel language; portable methods; power costs; power ratios; software-centric approaches; Clocks; Field programmable gate arrays; Graphics processing unit; Instruction sets; Kernel; Parallel processing; Pipelines;
fLanguage
English
Publisher
ieee
Conference_Titel
Field Programmable Logic and Applications (FPL), 2012 22nd International Conference on
Conference_Location
Oslo
Print_ISBN
978-1-4673-2257-7
Electronic_ISBN
978-1-4673-2255-3
Type
conf
DOI
10.1109/FPL.2012.6339171
Filename
6339171
Link To Document