DocumentCode
3639954
Title
Digit Recognition in the Náhuatl Language: An Evaluation Using Various Recognition Models
Author
Sergio Suarez-Guerra;Jose Luis Oropeza-Rodriguez;Juan Carlos Flores-Paulin;Luis Pastor Sanchez-Fernandez
fYear
2010
Firstpage
143
Lastpage
147
Abstract
The aim of Automatic Speech Recognition (ASR) is to develop techniques and systems that enable a computer to accept speech input. The digit recognition task has been often employed contributing to the ASR. In this work, we used parameters of Lineal Prediction Codes (LPC) and Mel Frequency Cepstrum Coefficients (MFCCs). For selection of the best analysis interval we used a Vector Quantization Model. For recognition, we applied the Continuous Density Hidden Markov Model (CDHHM), which employed a dictionary conformed of eighteen command words that are specific digits from the Náhuatl language. The obtained results were compared using Discrete Hidden Markov Models and Vector Quantization Models. In this experiment, we obtained a performance of 99% accuracy for digit recognition. In our experiments we used three native speakers.
Keywords
"Hidden Markov models","Speech recognition","Speech","Training","Viterbi algorithm","Mel frequency cepstral coefficient","Vector quantization"
Publisher
ieee
Conference_Titel
Artificial Intelligence (MICAI), 2010 Ninth Mexican International Conference on
Print_ISBN
978-1-4244-9246-6;978-0-7695-4284-3
Type
conf
DOI
10.1109/MICAI.2010.27
Filename
5699185
Link To Document