DocumentCode :
3080686
Title :
Evaluating the potential of graphics processors for high performance embedded computing
Author :
Mu, Shuai ; Wang, Chenxi ; Liu, Ming ; Li, Dongdong ; Zhu, Maohua ; Chen, Xiaoliang ; Xie, Xiang ; Deng, Yangdong
Author_Institution :
Tsinghua Univ., Beijing, China
fYear :
2011
fDate :
14-18 March 2011
Firstpage :
1
Lastpage :
6
Abstract :
Today´s high performance embedded computing applications are posing significant challenges for processing throughout. Traditionally, such applications have been realized on application specific integrated circuits (ASICs) and/or digital signal processors (DSP). However, ASICs´ advantage in performance and power often could not justify the fast increasing fabrication cost, while current DSP offers a limited processing throughput that is usually lower than 100GFLOPS. On the other hand, current multi-core processors, especially graphics processing units (GPUs), deliver very high computing throughput, and at the same time maintain high flexibility and programmability. It is thus appealing to study the potential of GPUs for high performance embedded computing. In this work, we perform a comprehensive performance evaluation on GPUs with the high performance embedded computing (HPEC) benchmark suite, which consist a broad range of signal processing benchmarks with an emphasis on radar processing applications. We develop efficient GPU implementations that could outperform previous results for all the benchmarks. In addition, a systematic instruction level analysis for the GPU implementations is conducted with a GPU micro-architecture simulator. The results provide key insights on optimizing GPU hardware and software. Meanwhile, we also compared the performance and power efficiency between GPU and DSP with the HPEC benchmarks. The comparison reveals that the major hurdle for GPU´s applications in embedded computing is its relatively low power efficiency.
Keywords :
computer graphic equipment; coprocessors; embedded systems; multiprocessing systems; radar signal processing; application specific integrated circuits; digital signal processors; graphics processors; high performance embedded computing; multicore processors; radar processing application; Benchmark testing; Digital signal processing; Graphics processing unit; Instruction sets; Kernel; Parallel processing; Throughput; DSP; Fermi; GFLOPS; GPU; HPEC benchmark; Multi-core; parallel computing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Design, Automation & Test in Europe Conference & Exhibition (DATE), 2011
Conference_Location :
Grenoble
ISSN :
1530-1591
Print_ISBN :
978-1-61284-208-0
Type :
conf
DOI :
10.1109/DATE.2011.5763120
Filename :
5763120
Link To Document :
بازگشت