Title :
Bidirectional OM-LSA speech estimator for noise robust speech recognition
Author :
Obuchi, Yasunari ; Takeda, Ryu ; Togami, Masahito
Author_Institution :
Central Res. Lab., Hitachi Ltd., Tokyo, Japan
Abstract :
A new speech enhancement method using bidirectional speech estimator is introduced. A widely-known speech enhancement method using the optimally-modified log spectral amplitude (OM-LSA) speech estimator is re-modified under the assumption that the frame-synchronous estimation is not essential in some of the speech recognition applications. The new method utilizes two separate flows of the speech gain estimation, one is along the forward direction of time and the other along the backward direction. A simple look-ahead estimation mechanism is also implemented in each flow. By taking the average of these two gains, the speech estimation becomes more robust under various noise conditions. Evaluation experiments using the artificial and real noisy speech data confirm that the speech recognition accuracy can be greatly improved by the proposed method.
Keywords :
optimisation; speech recognition; bidirectional OM-LSA speech estimator; frame-synchronous estimation; look-ahead estimation mechanism; noise robust speech recognition; noisy speech data; optimally-modified log spectral amplitude speech estimator; speech enhancement method; speech gain estimation; Noise measurement; Signal to noise ratio; Solids; Speech; Speech enhancement; Speech recognition; Training;
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
DOI :
10.1109/ASRU.2011.6163926