• DocumentCode
    3229544
  • Title

    A successive state splitting algorithm for efficient allophone modeling

  • Author

    Takami, Jun-Ichi ; Sagayama, Shigeki

  • Author_Institution
    ATR Interpreting Telephone Res. Lab., Kyoto, Japan
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    573
  • Abstract
    The authors propose an algorithm, successive state splitting (SSS), for simultaneously finding an optimal set of phoneme context classes, an optimal topology, and optimal parameters for hidden Markov models (HMMs) commonly using a maximum likelihood criterion. With this algorithm, a hidden Markov network (HM-Net), which is an efficient representation of phoneme-context-dependent HMMs, can be generated automatically. The authors implemented this algorithm, and tested it on the recognition of six Japanese consonants (|b|, |d|, |g|, |m|, |n| and |N|). The HM-Net gave better recognition results with a lower number of total output probability density distributions than conventional phoneme-context-independent mixture Gaussian density HMMs
  • Keywords
    hidden Markov models; speech recognition; HMM; Japanese consonants; allophone modeling; hidden Markov models; hidden Markov network; maximum likelihood criterion; optimal parameters; optimal topology; output probability density distributions; phoneme context classes; speech recognition; successive state splitting algorithm; Acoustic distortion; Context modeling; Hidden Markov models; Intelligent networks; Laboratories; Network topology; Robustness; Speech recognition; Telephony; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225855
  • Filename
    225855