• DocumentCode
    810018
  • Title

    Modelling speech signals using formant frequencies as an intermediate representation

  • Author

    Russell, M.J. ; Zheng, X. ; Jackson, P.J.B.

  • Author_Institution
    Dept. of Electron., Univ. of Birmingham
  • Volume
    1
  • Issue
    1
  • fYear
    2007
  • fDate
    3/1/2007 12:00:00 AM
  • Firstpage
    43
  • Lastpage
    50
  • Abstract
    Multiple-level segmental hidden Markov models (M-SHMMs) in which the relationship between symbolic and acoustic representations of speech is regulated by a formant-based intermediate representation are considered. New TIMIT phone recognition results are presented, confirming that the theoretical upper-bound on performance is achieved provided that either the intermediate representation or the formant-to-acoustic mapping is sufficiently rich. The way in which M-SHMMs exploit formant-based information is also investigated, using singular value decomposition of the formant-to-acoustic mappings and linear discriminant analysis. The analysis shows that if the intermediate layer contains information which is linearly related to the spectral representation, that information is used in preference to explicit formant frequencies, even though the latter are useful for phone discrimination. In summary, although these results confirm the utility of M-SHMMs for automatic speech recognition, they provide empirical evidence of the value of nonlinear formant-to-acoustic mappings
  • Keywords
    acoustic signal processing; hidden Markov models; singular value decomposition; speech recognition; automatic speech recognition; formant frequencies; formant-based intermediate representation; formant-to-acoustic mapping; linear discriminant analysis; multiple-level segmental hidden Markov models; phone recognition; singular value decomposition; spectral representation; speech signal modelling;
  • fLanguage
    English
  • Journal_Title
    Signal Processing, IET
  • Publisher
    iet
  • ISSN
    1751-9675
  • Type

    jour

  • DOI
    10.1049/iet-spr:20060179
  • Filename
    4159615