• DocumentCode
    2714335
  • Title

    Hierarchical subband linear predictive cepstral (HSLPC) features for HMM-based speech recognition

  • Author

    Chengalvarayan, Rathinavelu

  • Author_Institution
    Speech Process. Group, Lucent Technol., Naperville, IL, USA
  • Volume
    1
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    409
  • Abstract
    A new approach for linear prediction (LP) analysis is explored, where predictor can be computed from a mel-warped subband-based autocorrelation functions obtained from the power spectrum. For spectral representation a set of multi-resolution cepstral features are proposed. The general idea is to divide up the full frequency-band into several subbands, perform the IDFT on the mel power spectrum for each subband, followed by Durbin´s algorithm and the standard conversion from LP to cepstral coefficients. This approach can be extended to several levels of different resolutions. Multi-resolution feature vectors, formed by concatenation of the subband cepstral features into an extended feature vector, are shown to yield better performance than the conventional mel-warped LPCCs over the full voice-bandwidth for a connected digit recognition task
  • Keywords
    correlation methods; discrete Fourier transforms; feature extraction; hidden Markov models; inverse problems; prediction theory; signal representation; signal resolution; spectral analysis; speech recognition; Durbin´s algorithm; HMM-based speech recognition; IDFT; LP coefficients; cepstral coefficients; connected digit recognition task; frequency-band; hierarchical subband linear predictive cepstral features; linear prediction analysis; mel power spectrum; mel-warped subband-based autocorrelation functions; multi-resolution cepstral features; multi-resolution feature vectors; performance; spectral representation; subband cepstral features; voice-bandwidth; Cepstral analysis; Computer interfaces; Harmonic analysis; Hidden Markov models; Pattern analysis; Signal analysis; Speech analysis; Speech processing; Speech recognition; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.758149
  • Filename
    758149