مرکز منطقه ای اطلاع رساني علوم و فناوري - Pipelining Acoustic Model Training for Speech Recognition Using Storm

DocumentCode :

643340

Title :

Pipelining Acoustic Model Training for Speech Recognition Using Storm

Author :

Sitaram, Dinkar ; Srinivasaraghavan, Haripriya ; Agarwal, K. ; Agrawal, Ankit ; Joshi, Niranjan ; Ray, Debtanu

Author_Institution :

Comput. Sci. Dept., PES Inst. of Technol., Bangalore, India

fYear :

2013

fDate :

24-25 Sept. 2013

Firstpage :

219

Lastpage :

224

Abstract :

Speech recognition has been increasingly used on mobile devices, which has in turn increased the need for creation of new acoustic models for various languages, dialects, accents, speakers and environmental conditions. This involves training and adapting a huge number of acoustic models, some of them in real-time. Training acoustic models is thus essential for speech recognition because these models determine the accuracy and quality of the recognition process. This paper, discusses the use of Storm, a distributed real time computational system, to pipeline the creation of acoustic models by CMU Sphinx, an open-source software project for speech recognition and training. Software pipelining reduces the time required for training and optimizes system resource utilization, thus enabling huge amounts of data to be trained in considerably less amount of time than taken by the conventional sequential process. Pipelining is achieved by grouping the stages of the training process into a set of five stages, and running each stage on individual nodes in a Storm cluster. Thus acoustic models are created by training multiple streams of speech samples using the same SphinxTrain setup, also resulting in improvement of training time and throughput.

Keywords :

mobile computing; pipeline processing; public domain software; real-time systems; resource allocation; speech recognition; CMU Sphinx; SphinxTrain setup; Storm cluster; acoustic models; distributed real time computational system; environmental conditions; mobile devices; open-source software project; pipelining acoustic model training; recognition process; resource utilization; sequential process; software pipelining; speaker conditions; speech recognition; speech sample streams; training acoustic models; Acoustics; Adaptation models; Hidden Markov models; Pipeline processing; Speech recognition; Storms; Training; CMU Sphinx; Parallel Computing; Pipelining; Speech; SphinxTrain; Storm;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computational Intelligence, Modelling and Simulation (CIMSim), 2013 Fifth International Conference on

Conference_Location :

Seoul

Print_ISBN :

978-1-4799-2308-3

Type :

conf

DOI :

10.1109/CIMSim.2013.42

Filename :

6663188

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=643340