• DocumentCode
    1544785
  • Title

    Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition

  • Author

    Vergin, Rivarol ; O´Shaughnessy, Douglas ; Farhat, Azarshid

  • Author_Institution
    INRS Telecommun., Ile des Soeurs, Que., Canada
  • Volume
    7
  • Issue
    5
  • fYear
    1999
  • fDate
    9/1/1999 12:00:00 AM
  • Firstpage
    525
  • Lastpage
    532
  • Abstract
    The focus of a continuous speech recognition process is to match an input signal with a set of words or sentences according to some optimality criteria. The first step of this process is parameterization, whose major task is data reduction by converting the input signal into parameters while preserving virtually all of the speech signal information dealing with the text message. This contribution presents a detailed analysis of a widely used set of parameters, the mel frequency cepstral coefficients (MFCCs), and suggests a new parameterization approach taking into account the whole energy zone in the spectrum. Results obtained with the proposed new coefficients give a confidence interval about their use in a large-vocabulary speaker-independent continuous-speech recognition system
  • Keywords
    cepstral analysis; parameter estimation; speech recognition; confidence interval; energy zone; generalized mel frequency cepstral coefficients; input signal; interpolation; large-vocabulary continuous-speech recognition; optimality criteria; parameters; sentences; speaker-independent continuous-speech recognition; spectrum; text message; words; Acoustic applications; Cepstral analysis; Data mining; Discrete Fourier transforms; Filters; Linear predictive coding; Mel frequency cepstral coefficient; Signal processing; Speech recognition; Speech synthesis;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.784104
  • Filename
    784104