DocumentCode
2308528
Title
Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System for Hand-Held Devices
Author
Huggins-Daines, David ; Kumar, Mohit ; Chan, Arthur ; Black, Alan W. ; Ravishankar, Mosur ; Rudnicky, Alex I.
Author_Institution
Inst. of Language Technol., Carnegie Mellon Univ., Pittsburgh, PA
Volume
1
fYear
2006
fDate
14-19 May 2006
Abstract
The availability of real-time continuous speech recognition on mobile and embedded devices has opened up a wide range of research opportunities in human-computer interactive applications. Unfortunately, most of the work in this area to date has been confined to proprietary software, or has focused on limited domains with constrained grammars. In this paper, we present a preliminary case study on the porting and optimization of CMU Sphinx-11, a popular open source large vocabulary continuous speech recognition (LVCSR) system, to hand-held devices. The resulting system operates in an average 0.87 times real-time on a 206 MHz device, 8.03 times faster than the baseline system. To our knowledge, this is the first hand-held LVCSR system available under an open-source license
Keywords
human computer interaction; mobile handsets; speech recognition; 206 MHz; PocketSphinx; embedded devices; hand-held devices; human-computer interactive applications; large vocabulary continuous speech recognition; mobile devices; real-time continuous speech recognition system; Application software; Graphical user interfaces; Hardware; Licenses; Natural languages; Open source software; Operating systems; Real time systems; Speech recognition; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1659988
Filename
1659988
Link To Document