• DocumentCode
    3124870
  • Title

    TDOA information based vad for robust speech recognition in directional and diffuse noise field

  • Author

    Kuan-Lang Huang ; Tai-Shih Chi

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
  • fYear
    2012
  • fDate
    5-8 Dec. 2012
  • Firstpage
    126
  • Lastpage
    130
  • Abstract
    A two-microphone algorithm is proposed to improve automatic speech recognition (ASR) rates when target speech is corrupted by directional interferences and diffuse noise simultaneously. The algorithm adopts the time difference of arrival (TDOA) to suppress directional interferences and a TDOA-information based voice activity detector (VAD) to suppress diffuse noise. Simulation results show the proposed algorithm is effective in improving ASR rates in a sound field mixed with a directional interference and diffuse noise. Compared with the phase difference (PD) algorithm, the proposed method gives comparable recognition rates when facing a directional interference and much higher and more robust recognition rates when diffuse noise emerges.
  • Keywords
    audio signal processing; interference (signal); microphone arrays; noise abatement; speech recognition; time-of-arrival estimation; ASR rate improvement; TDOA-information based voice activity detector; VAD; automatic speech recognition rate improvement; diffuse noise suppression; directional interference suppression; sound field; target speech; time difference-of-arrival; two-microphone algorithm; Databases; Interference; Microphone arrays; Noise; Speech; Speech recognition; Diffuse noise; directional interference; phase difference; time difference of arrival; voice activity detector;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
  • Conference_Location
    Kowloon
  • Print_ISBN
    978-1-4673-2506-6
  • Electronic_ISBN
    978-1-4673-2505-9
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2012.6423514
  • Filename
    6423514