• DocumentCode
    2023077
  • Title

    A supervised approach to the construction of context-sensitive acoustic prototypes

  • Author

    Bellegarda, Jerome R. ; De Souza, Peter V. ; Nahamoo, David ; Picheny, Michael A. ; Bahl, Lalit R.

  • Author_Institution
    IBM T.J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    2
  • fYear
    1993
  • fDate
    27-30 April 1993
  • Firstpage
    644
  • Abstract
    The authors describe a supervised approach to the construction of context-sensitive acoustic prototypes for use in speech recognition systems using allophonic subword hidden Markov models (HMMs). A properly fine partition of the underlying acoustic space(s) is achieved by incorporating contextual supervision to relate the HMM allophonic models to their acoustic manifestations. By decoupling the overall procedure into a clustering phase followed by a pruning phase, it becomes possible to uncover satisfactorily once and for all the general interrelationships between various acoustic subevents, while customizing the acoustic prototypes themselves according to the available training data. This makes for a more efficient utilization of the training sentences, as evidenced by a substantial reduction in the error rate with respect to a baseline system not taking advantage of supervision. The performance of this method is illustrated on an isolated utterance speech recognition task with a vocabulary of 20000 words.<>
  • Keywords
    context-sensitive languages; hidden Markov models; learning (artificial intelligence); speech recognition; HMM; acoustic subevents; allophonic subword hidden Markov models; clustering phase; context-sensitive acoustic prototypes; contextual supervision; error rate; performance; pruning phase; speech recognition systems; training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
  • Conference_Location
    Minneapolis, MN, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1993.319392
  • Filename
    319392