• DocumentCode
    3111747
  • Title

    Microphone array signal processing for far-talking speech recognition

  • Author

    Chien, Jen-Tzung ; Lai, Jain-Ray ; Lai, Po-Yin

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    322
  • Lastpage
    325
  • Abstract
    This paper presents a combined microphone array and model adaptation algorithm for distant speech recognition. We aim at resolving the inconvenience of using a head-mounted/hand-holding microphone in a conventional speech recognizer. To improve the distant speech quality, a linear microphone array is applied and acts as a robust acquisition system. We develop a time-domain coherence measure (TDCM) to precisely detect the time delay of speech signals collected by different microphones. The estimated delay is adopted in a delay-and-sum beamformer for speech enhancement. Further, we adapt the speech hidden Markov models to get close to the acoustic condition of enhanced test speech for robust speech recognition. In acquisition and recognition experiments on connected Chinese digits, we find that TDCM can estimate the time delay as precisely as that calculated assuming the speech source direction is known. Increase of speech sampling rate is helpful to determine time delay. Also, the incorporation of the model adaptation scheme can significantly reduce the recognition errors with moderate computation overhead
  • Keywords
    array signal processing; delay estimation; hidden Markov models; microphones; signal sampling; speech enhancement; speech recognition; connected Chinese digits; delay-and-sum beamformer; distant speech recognition; far-talking speech recognition; hidden Markov models; linear microphone array; microphone array signal processing; model adaptation algorithm; robust acquisition system; speech enhancement; speech sampling rate; time delay estimation; time-domain coherence measure; Adaptation model; Array signal processing; Delay effects; Delay estimation; Microphone arrays; Robustness; Signal processing algorithms; Speech enhancement; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Wireless Communications, 2001. (SPAWC '01). 2001 IEEE Third Workshop on Signal Processing Advances in
  • Conference_Location
    Taiwan
  • Print_ISBN
    0-7803-6720-0
  • Type

    conf

  • DOI
    10.1109/SPAWC.2001.923915
  • Filename
    923915