مرکز منطقه ای اطلاع رساني علوم و فناوري - GPU-Accelerated HMM for Speech Recognition

DocumentCode :

3588954

Title :

GPU-Accelerated HMM for Speech Recognition

Author :

Leiming Yu ; Ukidave, Yash ; Kaeli, David

Author_Institution :

Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA, USA

fYear :

2014

Firstpage :

395

Lastpage :

402

Abstract :

Speech recognition is used in a wide range of applications and devices such as mobile phones, in-car entertainment systems and web-based services. Hidden Markov Models (HMMs) is one of the most popular algorithmic approaches applied in speech recognition. Training and testing a HMM is computationally intensive and time-consuming. Running multiple applications concurrently with speech recognition could overwhelm the compute resources, and introduce unwanted delays in the speech processing, eventually dropping words in the process due to buffer overruns. Graphics processing units (GPUs) have become widely accepted as accelerators which offer massive amounts of parallelism. The host processor (the CPU) can offload compute-intensive portions of an application to the GPU, leaving the CPU to focus on serial tasks and scheduling operations. In this paper, we provide a parallelized Hidden Markov Model to accelerate isolated words speech recognition. We experiment with different optimization schemes and make use of optimized GPU computing libraries to speedup the computation on GPUs. We also explore the performance benefits of using advanced GPU features for concurrent execution of multiple compute kernels. The algorithms are evaluated on multiple Nvidia GPUs using CUDA as a programming framework. Our GPU implementation achieves better performance than traditional serial and multithreaded implementations. When considering the end-to-end performance of the application, which includes both data transfer and computation, we achieve a 9x speedup for training with the use of a GPU over a multi-threaded version optimized for a multi-core CPU.

Keywords :

graphics processing units; hidden Markov models; microprocessor chips; multi-threading; speech recognition; GPU-accelerated HMM; Web-based services; graphics processing unit; hidden Markov model; in-car entertainment system; mobile phones; multicore CPU; multiple compute kernel; multithreaded version; programming framework; speech recognition; Complexity theory; Graphics processing units; Hidden Markov models; Kernel; Nickel; Parallel processing; Speech recognition; GPUs; Hidden Markov Model; Speech Recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Parallel Processing Workshops (ICCPW), 2014 43rd International Conference on

ISSN :

1530-2016

Type :

conf

DOI :

10.1109/ICPPW.2014.59

Filename :

7103477

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3588954