• DocumentCode
    2155794
  • Title

    Spectrum-entropy based beam-former with speaker tracking for hands-free continuous speech recognition in noise

  • Author

    George, Nokas ; Evangelos, Dermatas

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Patras Univ., Greece
  • Volume
    1
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    251
  • Abstract
    In hands-free speech recognition of moving speakers, the time interval where the source position can be assumed stationary varies. It is very common for the speaker to move rapidly within the data window exploited. In such cases the conventional fixed-window direction of arrival (DOA) estimation may lead to poor tracking performance. In this paper we present a novel speech beamformer for moving speakers in noisy environments. The localization algorithm extracts a set of candidate DOA of the signal sources using array signal processing methods in the frequency domain. A minimum variance (MV) beamformer identifies the speech signal DOA in the direction where the signal´s spectrum entropy is minimized. The same localization algorithm is used to detect the closest direction to the initial estimation using a smaller window. The proposed method is evaluated using a phoneme recognition system and noise recordings from an air-condition fan and the TIMIT speech corpus. Extended experiments, carried out in the range of 25-0 dB SNR, show significant improvement in the recognition rate of moving speakers especially in very low SNR.
  • Keywords
    array signal processing; direction-of-arrival estimation; frequency-domain analysis; identification; minimum entropy methods; spectral analysis; speech processing; speech recognition; DOA estimation; array signal processing; direction of arrival estimation; frequency domain; hands-free continuous speech recognition; identification; localization algorithm; minimum variance beamformer; moving speakers; noise recordings; noisy environments; phoneme recognition system; speaker tracking; spectrum entropy minimization; speech signal; Array signal processing; Data mining; Direction of arrival estimation; Entropy; Frequency domain analysis; Signal processing; Signal processing algorithms; Signal to noise ratio; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing, 2002. DSP 2002. 2002 14th International Conference on
  • Print_ISBN
    0-7803-7503-3
  • Type

    conf

  • DOI
    10.1109/ICDSP.2002.1027881
  • Filename
    1027881