DocumentCode :
2049657
Title :
Hardware speech recognition for user interfaces in low cost, low power devices
Author :
Nedevschi, Sergiu ; Patra, Rabin K. ; Brewer, Eric A.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., California Univ., Berkeley, CA, USA
fYear :
2005
fDate :
13-17 June 2005
Firstpage :
684
Lastpage :
689
Abstract :
We propose a system architecture for real-time hardware speech recognition on low-cost, power-constrained devices. The system is intended to support real-time speech-based user interfaces as part of an effort to bring information and communication technologies (ICTs) to underdeveloped regions of the world. Our system architecture exploits a shared infrastructure model. The computationally intensive task of speech model training and retraining is performed offline by shared servers, while the actual recognition of speech is conducted on low-cost hand-held devices using custom hardware. The recognizer is extremely flexible and can support multiple languages or dialects with speaker-independent recognition. Dynamic loading of speech models is used for changing language grammar and retraining, while reprogramming is used to support evolution of recognition algorithms. The focus on small sets of words (at one time) reduces the complexity, cost and power consumption. We design the speech decoder, the central component of the recognizer, and we validate it via a prototype FPGA implementation. We then use ASIC synthesis to estimate power and size for the design. Our evaluations demonstrate an order of magnitude improvement in power compared with optimized recognition software running on a low-power embedded general-purpose processor of the same technology and of similar capabilities. The synthesis also estimates the area of the design to be about 2.5mm, showing potential for lower cost. In designing and testing our recognizer we use datasets in both English and Tamil languages.
Keywords :
application specific integrated circuits; field programmable gate arrays; natural languages; speech recognition; user interfaces; vocoders; ASIC synthesis; FPGA implementation; hand-held device; information communication technology; language grammar; power consumption; power-constrained device; speech decoder; speech model training; speech recognition; user interface; Communications technology; Computer architecture; Costs; Handheld computers; Hardware; Natural languages; Power system modeling; Real time systems; Speech recognition; User interfaces;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Design Automation Conference, 2005. Proceedings. 42nd
Print_ISBN :
1-59593-058-2
Type :
conf
DOI :
10.1109/DAC.2005.193899
Filename :
1510419
Link To Document :
بازگشت