• DocumentCode
    3339329
  • Title

    Robust continuous speech recognition system based on a microphone array

  • Author

    Lleida, E. ; Fernández, J. ; Masgrau, E.

  • Author_Institution
    Dept. Electron. & Commun. Eng., Zaragoza Univ., Spain
  • Volume
    1
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    241
  • Abstract
    A robust speech recognition system for videoconference applications is presented based on a microphone array. By means of a microphone array, the speech recognition system is able to know the position of the users and increase the signal-to-noise ratio (SNR) between the desired speaker signal and the interference from the other users. The user positions are estimated by means of the combination of a direction of arrival (DOA) estimation method with a speaker identification system. The beamforming is performed by using the spatial references of the desired speaker and the interference locations. A minimum variance algorithm with spatial constraints working in the frequency domain is used to design the weights of the broadband microphone array. Results of the speech recognition system are reported in a simulated environment with several users asking questions to a geographic data base
  • Keywords
    acoustic transducer arrays; array signal processing; direction-of-arrival estimation; interference (signal); microphones; speech processing; speech recognition; teleconferencing; DOA estimation; SNR; broadband microphone array; desired speaker signal; direction of arrival; frequency domain; frequency-domain beamforming; geographic data base; hidden Markov models; interference locations; microphone array; minimum variance algorithm; robust continuous speech recognition system; signal-to-noise ratio; simulated environment; spatial constraints; spatial references; speaker identification system; user position estimation; videoconference applications; weights; Algorithm design and analysis; Array signal processing; Direction of arrival estimation; Frequency domain analysis; Interference constraints; Microphone arrays; Robustness; Signal to noise ratio; Speech recognition; Videoconference;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.674412
  • Filename
    674412