• DocumentCode
    5806
  • Title

    Low-Cost Speaker and Language Recognition Systems Running on a Raspberry Pi

  • Author

    Haro, Luis Fernando D. ; Cordoba, Ricardo ; Rojo Rivero, Jose Ignacio ; Diez de la Fuente, Jorge ; Avendano Peces, Diego ; Bermudo Mera, Jose Maria

  • Author_Institution
    Dipt. de Ing. Electron., Univ. Politec. de Madrid, Madrid, Spain
  • Volume
    12
  • Issue
    4
  • fYear
    2014
  • fDate
    Jun-14
  • Firstpage
    755
  • Lastpage
    763
  • Abstract
    This paper describes two state-of-the-art and portable voice-based authentication and language recognition systems. While the authentication system allows secure access to a media center at home, the language recognition system can be used as a previous step to automatically transcribe and translate the recognized text from its original language into another one. The most important advantage of the developed systems is that they can run on a low cost embedded device, such as a Raspberry Pi (RPi), and using only open-source projects, which makes it feasible to replicate or include in other systems, but also allows its implementation as part of educational projects in electronics. The developed systems have been tested on real data with very good results. Regarding the authentication system, the validation process is done in 3.3 seconds in average with an EER of 19% on test files with 20 seconds, and tested with up to 87 different speakers. On the other hand, the language recognition system is able to recognize up to six languages. For this system, important efforts were done in order to reduce the processing time and memory requirements while keeping high the recognition rate. The final system uses 64 Gaussians and 200 i-vectors, obtaining a Cavg error rate of 8.6% for the six languages.
  • Keywords
    natural language processing; speaker recognition; Raspberry Pi; authentication system; language recognition systems; low cost speaker; open source projects; voice based authentication; Androids; Economic indicators; Humanoid robots; Media; Multimedia communication; Robustness; Software; Language recognition; Speaker recognition; embedded devices; i-vectors; open-source tools;
  • fLanguage
    English
  • Journal_Title
    Latin America Transactions, IEEE (Revista IEEE America Latina)
  • Publisher
    ieee
  • ISSN
    1548-0992
  • Type

    jour

  • DOI
    10.1109/TLA.2014.6868880
  • Filename
    6868880