DocumentCode :
3588954
Title :
GPU-Accelerated HMM for Speech Recognition
Author :
Leiming Yu ; Ukidave, Yash ; Kaeli, David
Author_Institution :
Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA, USA
fYear :
2014
Firstpage :
395
Lastpage :
402
Abstract :
Speech recognition is used in a wide range of applications and devices such as mobile phones, in-car entertainment systems and web-based services. Hidden Markov Models (HMMs) is one of the most popular algorithmic approaches applied in speech recognition. Training and testing a HMM is computationally intensive and time-consuming. Running multiple applications concurrently with speech recognition could overwhelm the compute resources, and introduce unwanted delays in the speech processing, eventually dropping words in the process due to buffer overruns. Graphics processing units (GPUs) have become widely accepted as accelerators which offer massive amounts of parallelism. The host processor (the CPU) can offload compute-intensive portions of an application to the GPU, leaving the CPU to focus on serial tasks and scheduling operations. In this paper, we provide a parallelized Hidden Markov Model to accelerate isolated words speech recognition. We experiment with different optimization schemes and make use of optimized GPU computing libraries to speedup the computation on GPUs. We also explore the performance benefits of using advanced GPU features for concurrent execution of multiple compute kernels. The algorithms are evaluated on multiple Nvidia GPUs using CUDA as a programming framework. Our GPU implementation achieves better performance than traditional serial and multithreaded implementations. When considering the end-to-end performance of the application, which includes both data transfer and computation, we achieve a 9x speedup for training with the use of a GPU over a multi-threaded version optimized for a multi-core CPU.
Keywords :
graphics processing units; hidden Markov models; microprocessor chips; multi-threading; speech recognition; GPU-accelerated HMM; Web-based services; graphics processing unit; hidden Markov model; in-car entertainment system; mobile phones; multicore CPU; multiple compute kernel; multithreaded version; programming framework; speech recognition; Complexity theory; Graphics processing units; Hidden Markov models; Kernel; Nickel; Parallel processing; Speech recognition; GPUs; Hidden Markov Model; Speech Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing Workshops (ICCPW), 2014 43rd International Conference on
ISSN :
1530-2016
Type :
conf
DOI :
10.1109/ICPPW.2014.59
Filename :
7103477
Link To Document :
بازگشت