• DocumentCode
    302076
  • Title

    Knowledge-based parameters for HMM speech recognition

  • Author

    Bitar, Nabil N. ; Espy-Wilson, Carol Y.

  • Author_Institution
    Dept. of Electr. Comput. & Syst. Eng., Boston Univ., MA, USA
  • Volume
    1
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    29
  • Abstract
    This paper presents acoustic parameters (APs) that were motivated by phonetic feature theory and employed as a signal representation of speech in a hidden Markov model (HMM) recognition framework. Presently, the phonetic features considered are the manner features: sonorant, syllabic, nonsyllabic, noncontinuant and fricated. The objective of the parameters is to directly target the linguistic information in the signal and to reduce the speaker-dependent information that may yield large speech variability. To achieve these goals, the APs were defined in a relational manner across time or frequency. For evaluation, broad-class recognition experiments were conducted comparing the APs to cepstral-based parameters. The results of the experiments indicate that the APs are able to capture the phonetically relevant information in the speech signal and that, in comparison to the cepstral-based parameters, they are more able to reduce the interspeaker variability
  • Keywords
    acoustic signal processing; hidden Markov models; knowledge based systems; linguistics; parameter estimation; signal representation; speech processing; speech recognition; HMM; HMM speech recognition; acoustic parameters; cepstral based parameters; fricated features; hidden Markov model; interspeaker variability reduction; knowledge based parameters; linguistic information; manner features; noncontinuant features; nonsyllabic features; phonetic feature theory; sonorant features; speaker-dependent information reduction; speech recognition experiments; speech signal representation; syllabic features; Acoustical engineering; Context modeling; Frequency; Hidden Markov models; Natural languages; Signal representations; Speech processing; Speech recognition; Systems engineering and theory; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.540282
  • Filename
    540282