• DocumentCode
    417141
  • Title

    Robustness of speech recognition using genetic algorithms and a Mel-cepstral subspace approach

  • Author

    Selouani, S.A. ; O´Shaughnessy, D.

  • Author_Institution
    Univ. de Moncton, Campus De Shippagan, Canada
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    The paper presents a method to compensate Mel-frequency cepstral coefficients (MFCCs) for a HMM-based speech recognition system evolving under telephone-channel degradations. The technique we propose is based on the combination of the Karhonen-Loeve transform (KLT) and genetic algorithms (GA). The idea consists of projecting the band-limited MFCCs onto a subspace generated by the genetically optimized KLT principal axes. Experiments show a clear improvement when the method is applied to the NTIMIT telephone speech database. Word recognition results obtained on the HTK toolkit platform using N-mixture triphone models and a bigram language model are presented and discussed.
  • Keywords
    Karhunen-Loeve transforms; genetic algorithms; hidden Markov models; speech recognition; telephony; HMM; Karhonen-Loeve transform; MFCC; Mel-cepstral subspace approach; Mel-frequency cepstral coefficients; bigram language model; genetic algorithms; speech recognition robustness; telephone-channel degradation; triphone models; word recognition; Acoustic testing; Additive noise; Degradation; Genetic algorithms; Karhunen-Loeve transforms; Noise generators; Robustness; Speech recognition; Telephony; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1325957
  • Filename
    1325957