DocumentCode
2943146
Title
Structure optimization of metamodels to improve speech recognition accuracy
Author
Morales, Santiago Omar Caballero
Author_Institution
Postgrad. Div., Technol. Univ. of the Mixtec Region, Huajuapan de León, Mexico
fYear
2011
fDate
Feb. 28 2011-March 2 2011
Firstpage
125
Lastpage
130
Abstract
The metamodels is a technique that was developed to model a speaker´s phoneme confusion-matrix and use this information to increase speech recognition accuracy for speakers with disordered and normal speech. Approaches to improve the performance of the metamodels, mainly focused on obtaining better estimates of the speaker´s confusion-matrix, were studied. While some achieved significant improvements, alternatives to the functional structure of the metamodels were not explored. In this paper is proposed a different structure for the metamodel of a phoneme and its optimization by means of a genetic algorithm. Results showed statistically significant gains in speech recognition accuracy over the previous metamodels.
Keywords
genetic algorithms; speech recognition; genetic algorithm; metamodels; speaker phoneme confusion-matrix; speech recognition; structure optimization; Accuracy; Context; Gallium; Hidden Markov models; Optimization; Speech recognition; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical Communications and Computers (CONIELECOMP), 2011 21st International Conference on
Conference_Location
San Andres Cholula
Print_ISBN
978-1-4244-9558-0
Type
conf
DOI
10.1109/CONIELECOMP.2011.5749348
Filename
5749348
Link To Document