DocumentCode
2155794
Title
Spectrum-entropy based beam-former with speaker tracking for hands-free continuous speech recognition in noise
Author
George, Nokas ; Evangelos, Dermatas
Author_Institution
Dept. of Electr. & Comput. Eng., Patras Univ., Greece
Volume
1
fYear
2002
fDate
2002
Firstpage
251
Abstract
In hands-free speech recognition of moving speakers, the time interval where the source position can be assumed stationary varies. It is very common for the speaker to move rapidly within the data window exploited. In such cases the conventional fixed-window direction of arrival (DOA) estimation may lead to poor tracking performance. In this paper we present a novel speech beamformer for moving speakers in noisy environments. The localization algorithm extracts a set of candidate DOA of the signal sources using array signal processing methods in the frequency domain. A minimum variance (MV) beamformer identifies the speech signal DOA in the direction where the signal´s spectrum entropy is minimized. The same localization algorithm is used to detect the closest direction to the initial estimation using a smaller window. The proposed method is evaluated using a phoneme recognition system and noise recordings from an air-condition fan and the TIMIT speech corpus. Extended experiments, carried out in the range of 25-0 dB SNR, show significant improvement in the recognition rate of moving speakers especially in very low SNR.
Keywords
array signal processing; direction-of-arrival estimation; frequency-domain analysis; identification; minimum entropy methods; spectral analysis; speech processing; speech recognition; DOA estimation; array signal processing; direction of arrival estimation; frequency domain; hands-free continuous speech recognition; identification; localization algorithm; minimum variance beamformer; moving speakers; noise recordings; noisy environments; phoneme recognition system; speaker tracking; spectrum entropy minimization; speech signal; Array signal processing; Data mining; Direction of arrival estimation; Entropy; Frequency domain analysis; Signal processing; Signal processing algorithms; Signal to noise ratio; Speech recognition; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Signal Processing, 2002. DSP 2002. 2002 14th International Conference on
Print_ISBN
0-7803-7503-3
Type
conf
DOI
10.1109/ICDSP.2002.1027881
Filename
1027881
Link To Document