DocumentCode
5806
Title
Low-Cost Speaker and Language Recognition Systems Running on a Raspberry Pi
Author
Haro, Luis Fernando D. ; Cordoba, Ricardo ; Rojo Rivero, Jose Ignacio ; Diez de la Fuente, Jorge ; Avendano Peces, Diego ; Bermudo Mera, Jose Maria
Author_Institution
Dipt. de Ing. Electron., Univ. Politec. de Madrid, Madrid, Spain
Volume
12
Issue
4
fYear
2014
fDate
Jun-14
Firstpage
755
Lastpage
763
Abstract
This paper describes two state-of-the-art and portable voice-based authentication and language recognition systems. While the authentication system allows secure access to a media center at home, the language recognition system can be used as a previous step to automatically transcribe and translate the recognized text from its original language into another one. The most important advantage of the developed systems is that they can run on a low cost embedded device, such as a Raspberry Pi (RPi), and using only open-source projects, which makes it feasible to replicate or include in other systems, but also allows its implementation as part of educational projects in electronics. The developed systems have been tested on real data with very good results. Regarding the authentication system, the validation process is done in 3.3 seconds in average with an EER of 19% on test files with 20 seconds, and tested with up to 87 different speakers. On the other hand, the language recognition system is able to recognize up to six languages. For this system, important efforts were done in order to reduce the processing time and memory requirements while keeping high the recognition rate. The final system uses 64 Gaussians and 200 i-vectors, obtaining a Cavg error rate of 8.6% for the six languages.
Keywords
natural language processing; speaker recognition; Raspberry Pi; authentication system; language recognition systems; low cost speaker; open source projects; voice based authentication; Androids; Economic indicators; Humanoid robots; Media; Multimedia communication; Robustness; Software; Language recognition; Speaker recognition; embedded devices; i-vectors; open-source tools;
fLanguage
English
Journal_Title
Latin America Transactions, IEEE (Revista IEEE America Latina)
Publisher
ieee
ISSN
1548-0992
Type
jour
DOI
10.1109/TLA.2014.6868880
Filename
6868880
Link To Document