DocumentCode :
3485109
Title :
Bidirectional OM-LSA speech estimator for noise robust speech recognition
Author :
Obuchi, Yasunari ; Takeda, Ryu ; Togami, Masahito
Author_Institution :
Central Res. Lab., Hitachi Ltd., Tokyo, Japan
fYear :
2011
fDate :
11-15 Dec. 2011
Firstpage :
173
Lastpage :
178
Abstract :
A new speech enhancement method using bidirectional speech estimator is introduced. A widely-known speech enhancement method using the optimally-modified log spectral amplitude (OM-LSA) speech estimator is re-modified under the assumption that the frame-synchronous estimation is not essential in some of the speech recognition applications. The new method utilizes two separate flows of the speech gain estimation, one is along the forward direction of time and the other along the backward direction. A simple look-ahead estimation mechanism is also implemented in each flow. By taking the average of these two gains, the speech estimation becomes more robust under various noise conditions. Evaluation experiments using the artificial and real noisy speech data confirm that the speech recognition accuracy can be greatly improved by the proposed method.
Keywords :
optimisation; speech recognition; bidirectional OM-LSA speech estimator; frame-synchronous estimation; look-ahead estimation mechanism; noise robust speech recognition; noisy speech data; optimally-modified log spectral amplitude speech estimator; speech enhancement method; speech gain estimation; Noise measurement; Signal to noise ratio; Solids; Speech; Speech enhancement; Speech recognition; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
Type :
conf
DOI :
10.1109/ASRU.2011.6163926
Filename :
6163926
Link To Document :
بازگشت