Title :
Fast acoustic computations using graphics processors
Author :
Dixon, Paul R. ; Oonishi, Tasuku ; Furui, Sadaoki
Author_Institution :
Dept. of Comput. Sci., Tokyo Inst. of Technol., Tokyo
Abstract :
In this paper we present a fast method for computing acoustic likelihoods that makes use of a graphics processing unit (GPU). After enabling the GPU acceleration the main processor runtime dedicated to acoustic scoring tasks is reduced from the largest consumer to just a few percent even when using mixture models with a large number of Gaussian components. The results show a large reduction in decoding time with no change in accuracy and we also show by using a 16bit half precision floating point format for the acoustic model parameters, storage requirements can be halved with no reduction in accuracy.
Keywords :
acoustic signal processing; computer graphic equipment; speech coding; speech recognition; Gaussian mixture model components; acoustic model parameters; decoding; graphics processing unit; speech recognition; Bandwidth; Computer architecture; Computer graphics; Computer science; Covariance matrix; Decoding; Hardware; Speech recognition; Vocabulary; Yarn; GPGPU; LVCSR; Speech recognition; WFST;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960585