DocumentCode :
3150314
Title :
Multi-user real-time speech recognition with a GPU
Author :
Kim, Jungsuk ; Sung, Wonyong
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Seoul Nat. Univ., Seoul, South Korea
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
1617
Lastpage :
1620
Abstract :
We have developed a multi-user large vocabulary speech recognition system employing a fully composed one-level weighted finite state transducer (WFST) based network on a Graphics Processing Unit (GPU). This system improves the overall throughput and latency of speech recognition engine which processes multiple users´ utterances at the same time with efficient scheduling, parameter sharing, and communication overhead reduction techniques. We conduct both batch speech simulation and trace driven online simulation to access the performance of the developed system. Traces are generated based on a queueing model.
Keywords :
graphics processing units; queueing theory; speech recognition; GPU; batch speech simulation; communication overhead reduction technique; graphics processing unit; multiuser large vocabulary speech recognition system; multiuser real-time speech recognition; parameter sharing; queueing model; speech recognition engine; trace driven online simulation; weighted finite state transducer based network; Acoustics; Engines; Graphics processing unit; Hidden Markov models; Servers; Speech; Speech recognition; Distributed Speech Recognition; GPU; LVCSR; Speech recognition; WFST;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288204
Filename :
6288204
Link To Document :
بازگشت