• DocumentCode
    294594
  • Title

    Use of generalized dynamic feature parameters for speech recognition: maximum likelihood and minimum classification error approaches

  • Author

    Rathinavelu, C. ; Deng, L.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    373
  • Abstract
    In this study we implemented a speech recognizer based on the integrated view, proposed first by Deng (see IEEE Signal Processing Letters, vol.1, no.4, p.66-69, 1994), on the speech preprocessing and speech modeling problems in the recognizer design. The integrated model we developed generalizes the conventional, currently widely used delta-parameter technique, which has been confined strictly to the preprocessing domain only, in two significant ways. First, the new model contains state-dependent weighting functions responsible for transforming static speech features into the dynamic ones in a slowly time-varying manner. Second, novel maximum-likelihood and minimum-classification-error based learning algorithms are developed for the model that allows joint optimization of the state-dependent weighting functions and the remaining conventional HMM parameters. The experimental results obtained from a standard TIMIT phonetic classification task provide preliminary evidence for the effectiveness of our new, general approaches to the use of the dynamic characteristics of speech spectra
  • Keywords
    error analysis; hidden Markov models; maximum likelihood estimation; spectral analysis; speech processing; speech recognition; HMM parameters; TIMIT phonetic classification task; delta-parameter technique; dynamic characteristics; dynamic speech features; experimental results; generalized dynamic feature parameters; integrated model; learning algorithms; maximum likelihood classification error; minimum classification error; recognizer design; speech modeling; speech preprocessing; speech recognition; speech recognizer; speech spectra; state-dependent weighting functions; static speech features; Computer errors; Design optimization; Hidden Markov models; Speech enhancement; Speech processing; Speech recognition; Vectors; Yttrium;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479599
  • Filename
    479599