• DocumentCode
    1864537
  • Title

    Microphone-array speech recognition via incremental map training

  • Author

    Adcock, John E. ; Gotoh, Yoshihiko ; Mashao, Daniel J. ; Silverman, Hawey F.

  • Author_Institution
    LEMS, Brown Univ., Providence, RI, USA
  • Volume
    2
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    897
  • Abstract
    For a hidden Markov model (HMM) based speech recognition system it is desirable to combine enhancement of the acoustical signal and statistical representation of model parameters, ensuring both a high quality speech signal and an appropriately trained HMM. In this paper the incremental variant of maximum a posteriori (MAP) estimation is used to adjust the parameters of a talker-independent HMM-based speech recognition system to accurately recognize speech data acquired with a microphone-array. The approach is novel for a microphone-array speech recognition task in that a robust talker-independent model is derived from a baseline system using a relatively small amount of data for training. The results show that (1) ILIAP training significantly improves recognition performance compared to the baseline, and (2) beamforming signal enhancement outperforms single-channel enhancement before and after the adaptive MAP training
  • Keywords
    acoustic signal processing; acoustic transducer arrays; array signal processing; hidden Markov models; maximum likelihood estimation; microphones; speech enhancement; speech recognition; ILIAP training; MAP estimation; beamforming signal enhancement; enhancement; hidden Markov model; incremental map training; maximum a posteriori estimation; microphone-array speech recognition; robust talker-independent model; statistical representation; talker-independent HMM-based speech recognition system; Acoustical engineering; Array signal processing; Databases; Electronic mail; Hidden Markov models; Microphone arrays; Robustness; Speech enhancement; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.543266
  • Filename
    543266