مرکز منطقه ای اطلاع رساني علوم و فناوري - Deep recurrent de-noising auto-encoder and blind de-reverberation for reverberated speech recognition

DocumentCode :

179245

Title :

Deep recurrent de-noising auto-encoder and blind de-reverberation for reverberated speech recognition

Author :

Weninger, Felix ; Watanabe, Shigetaka ; Tachioka, Yuuki ; Schuller, Bjorn

Author_Institution :

Mitsubishi Electr. Res. Labs. (MERL), Cambridge, MA, USA

fYear :

2014

fDate :

4-9 May 2014

Firstpage :

4623

Lastpage :

4627

Abstract :

This paper describes our joint efforts to provide robust automatic speech recognition (ASR) for reverberated environments, such as in hands-free human-machine interaction. We investigate blind feature space de-reverberation and deep recurrent de-noising auto-encoders (DAE) in an early fusion scheme. Results on the 2014 REVERB Challenge development set indicate that the DAE front-end provides complementary performance gains to multi-condition training, feature transformations, and model adaptation. The proposed ASR system achieves word error rates of 17.62 % and 36.6 % on simulated and real data, which is a significant improvement over the Challenge baseline (25.16 and 47.2 %).

Keywords :

feature extraction; recurrent neural nets; reverberation; signal denoising; speech codecs; speech recognition; ASR; automatic speech recognition; blind dereverberation; blind feature space dereverberation; deep recurrent denoising auto-encoder; feature transformations; fusion scheme; hands-free human-machine interaction; model adaptation; multi-condition training; reverberated environments; reverberated speech recognition; word error rates; Adaptation models; Noise reduction; Reverberation; Speech; Speech recognition; Training; De-reverberation; automatic speech recognition; feature enhancement; recurrent neural networks;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location :

Florence

Type :

conf

DOI :

10.1109/ICASSP.2014.6854478

Filename :

6854478

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=179245