Title :
Development of highly accurate real-time large scale speech recognition system
Author :
Kim, I. ; Park, C. ; Lee, K. ; Kim, N. ; Lee, J. ; Kim, J. ; Lane, I.
Author_Institution :
DMC R&D Center, Samsung Electron., Suwon, South Korea
Abstract :
This paper describes the development of the framework and the algorithm for large scale automatic speech recognition systems. Technical advances include the acceleration of decoding speed by leveraging the computational power of many-core graphic processing units (GPU), in order to solve the issue of training data sparseness, improvement in the accuracy by Subspace Gaussian Mixture Models (SGMM), and employing novel methods of language models such as the Instant Language Model Adaptation (ILMA) method. We present the effectiveness of each technique by evaluating it with actual usage data collected from television sets. It is shown that the proposed engine can recognize speech at real time with high accuracy.
Keywords :
Gaussian processes; graphics processing units; mixture models; real-time systems; speech coding; speech recognition; GPU; ILMA method; SGMM; computational power; decoding speed; highly accurate real-time large scale speech recognition system; instant language model adaptation; many-core graphic processing units; subspace Gaussian mixture model; television sets; training data sparseness; Accuracy; Acoustics; Adaptation models; Data models; Decoding; Graphics processing units; Speech recognition;
Conference_Titel :
Consumer Electronics (ICCE), 2015 IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4799-7542-6
DOI :
10.1109/ICCE.2015.7066496