DocumentCode :
2973947
Title :
Three-layer optimizations for fast GMM computations on GPU-like parallel processors
Author :
Gupta, Kshitij ; Owens, John D.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California, Davis, CA, USA
fYear :
2009
fDate :
Nov. 13 2009-Dec. 17 2009
Firstpage :
146
Lastpage :
151
Abstract :
In this paper we focus on optimizing compute and memory-bandwidth-intensive GMM computations for low-end, small-form-factor devices running on GPU-like parallel processors. With special emphasis on tackling the memory bandwidth issue that is exacerbated by a lack of CPU-like caches providing temporal locality on GPU-like parallel processors, we propose modifications to three well-known GMM computation reduction techniques. We find considerable locality at the frame, CI-GMM, and mixture layers of GMM compute, and show how it can be extracted by following a chunk-based technique of processing multiple frames for every load of a GMM. On a 1,000- word, command-and-control, continuous-speech task, we are able to achieve compute and memory bandwidth savings of over 60% and 90% respectively, with some degradation in accuracy, when compared to existing GPU-based fast GMM computation techniques.
Keywords :
Gaussian processes; cache storage; coprocessors; memory architecture; optimisation; parallel processing; speech processing; CPU-like caches; GMM computation reduction techniques; GMM computation techniques; GPU-like parallel processors; chunk-based technique; command-and-control; continuous-speech task; fast GMM computations; memory bandwidth savings; memory-bandwidth-intensive GMM computations; small-form-factor devices; three-layer optimizations; Acceleration; Automatic speech recognition; Bandwidth; Computer architecture; Concurrent computing; Gaussian processes; Handheld computers; Hidden Markov models; Parallel processing; Pervasive computing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
Type :
conf
DOI :
10.1109/ASRU.2009.5373410
Filename :
5373410
Link To Document :
بازگشت