• DocumentCode
    2943146
  • Title

    Structure optimization of metamodels to improve speech recognition accuracy

  • Author

    Morales, Santiago Omar Caballero

  • Author_Institution
    Postgrad. Div., Technol. Univ. of the Mixtec Region, Huajuapan de León, Mexico
  • fYear
    2011
  • fDate
    Feb. 28 2011-March 2 2011
  • Firstpage
    125
  • Lastpage
    130
  • Abstract
    The metamodels is a technique that was developed to model a speaker´s phoneme confusion-matrix and use this information to increase speech recognition accuracy for speakers with disordered and normal speech. Approaches to improve the performance of the metamodels, mainly focused on obtaining better estimates of the speaker´s confusion-matrix, were studied. While some achieved significant improvements, alternatives to the functional structure of the metamodels were not explored. In this paper is proposed a different structure for the metamodel of a phoneme and its optimization by means of a genetic algorithm. Results showed statistically significant gains in speech recognition accuracy over the previous metamodels.
  • Keywords
    genetic algorithms; speech recognition; genetic algorithm; metamodels; speaker phoneme confusion-matrix; speech recognition; structure optimization; Accuracy; Context; Gallium; Hidden Markov models; Optimization; Speech recognition; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical Communications and Computers (CONIELECOMP), 2011 21st International Conference on
  • Conference_Location
    San Andres Cholula
  • Print_ISBN
    978-1-4244-9558-0
  • Type

    conf

  • DOI
    10.1109/CONIELECOMP.2011.5749348
  • Filename
    5749348