DocumentCode
542320
Title
Hands-free continuous speech recognition in noise using a speaker beam-former based on spectrum-entropy
Author
George, Nokas ; Evangelos, Dermatas
Author_Institution
Department of Electrical & Computer Engineering, University of Patras, 26500, Hellas, Greece
Volume
1
fYear
2002
fDate
13-17 May 2002
Abstract
Detection of the speaker position is a crucial task in hands-free speech recognition applications. In this paper we present a novel speech beam-former for noisy environments. Initially, the localization algorithm extracts a set of candidate directions of the signal sources using array signal processing methods in the frequency domain. Then, a minimum variance (MV) beam-former identifies the speech signal in the direction where the signal´s spectrum entropy is minimized. The proposed method is evaluated by a phoneme recognition system using noise recordings from an air-condition fan and the TIMIT speech corpus. Extended experiments, carried out in the range of 25–0 dB, show almost perfect estimation of the speaker DOA in all cases. As a consequence, the recognition rate increases significantly compared to the rate obtained by a single microphone. The recognition improvement increases especially in very low SNRs.
Keywords
Arrays; Entropy; Hidden Markov models; Robustness; Speech; Speech recognition; Three dimensional displays;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location
Orlando, FL, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5743882
Filename
5743882
Link To Document