• DocumentCode
    1343451
  • Title

    Linear trajectory models incorporating preprocessing parameters for speech recognition

  • Author

    Chengalvarayan, Rathinavelu

  • Author_Institution
    Lucent Technol., Bell Labs., Naperville, IL, USA
  • Volume
    5
  • Issue
    3
  • fYear
    1998
  • fDate
    3/1/1998 12:00:00 AM
  • Firstpage
    66
  • Lastpage
    68
  • Abstract
    In this letter, we investigate the interactions of front-end feature extraction and back-end classification techniques in nonstationary state hidden Markov model (NSHMM) based speech recognition. The proposed model aims at finding an optimal linear transformation on the mel-warped discrete Fourier transform (DFT) features according to the minimum classification error (MCE) criterion. This linear transformation, along with the NSHMM parameters, are automatically trained using the gradient descent method. An error rate reduction of 8% is obtained on a standard 39-class TIMIT phone classification task in comparison with the MCE-trained NSHMM using conventional preprocessing techniques.
  • Keywords
    discrete Fourier transforms; feature extraction; hidden Markov models; pattern classification; speech recognition; DFT features; HMM; TIMIT phone classification task; back-end classification; error rate reduction; front-end feature extraction; gradient descent method; linear trajectory models; mel-warped discrete Fourier transform; minimum classification error criterion; nonstationary state hidden Markov model; optimal linear transformation; preprocessing parameters; speech recognition; Cepstral analysis; Context modeling; Discrete Fourier transforms; Discrete cosine transforms; Error analysis; Feature extraction; Hidden Markov models; Speech recognition; Vectors;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/97.661563
  • Filename
    661563