Title :
Pipelining Acoustic Model Training for Speech Recognition Using Storm
Author :
Sitaram, Dinkar ; Srinivasaraghavan, Haripriya ; Agarwal, K. ; Agrawal, Ankit ; Joshi, Niranjan ; Ray, Debtanu
Author_Institution :
Comput. Sci. Dept., PES Inst. of Technol., Bangalore, India
Abstract :
Speech recognition has been increasingly used on mobile devices, which has in turn increased the need for creation of new acoustic models for various languages, dialects, accents, speakers and environmental conditions. This involves training and adapting a huge number of acoustic models, some of them in real-time. Training acoustic models is thus essential for speech recognition because these models determine the accuracy and quality of the recognition process. This paper, discusses the use of Storm, a distributed real time computational system, to pipeline the creation of acoustic models by CMU Sphinx, an open-source software project for speech recognition and training. Software pipelining reduces the time required for training and optimizes system resource utilization, thus enabling huge amounts of data to be trained in considerably less amount of time than taken by the conventional sequential process. Pipelining is achieved by grouping the stages of the training process into a set of five stages, and running each stage on individual nodes in a Storm cluster. Thus acoustic models are created by training multiple streams of speech samples using the same SphinxTrain setup, also resulting in improvement of training time and throughput.
Keywords :
mobile computing; pipeline processing; public domain software; real-time systems; resource allocation; speech recognition; CMU Sphinx; SphinxTrain setup; Storm cluster; acoustic models; distributed real time computational system; environmental conditions; mobile devices; open-source software project; pipelining acoustic model training; recognition process; resource utilization; sequential process; software pipelining; speaker conditions; speech recognition; speech sample streams; training acoustic models; Acoustics; Adaptation models; Hidden Markov models; Pipeline processing; Speech recognition; Storms; Training; CMU Sphinx; Parallel Computing; Pipelining; Speech; SphinxTrain; Storm;
Conference_Titel :
Computational Intelligence, Modelling and Simulation (CIMSim), 2013 Fifth International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4799-2308-3
DOI :
10.1109/CIMSim.2013.42