Title :
Robust speech recognition in unknown reverberant and noisy conditions
Author :
Roger Hsiao;Jeff Ma;William Hartmann;Martin Karafi?t;Franti?ek Gr?zl;Luk?? Burget;Igor Sz?ke;Jan Honza ?ernock?;Shinji Watanabe;Zhuo Chen;Sri Harish Mallidi;Hynek Hermansk?;Stavros Tsakalidis;Richard Schwartz
Author_Institution :
Raytheon BBN Technologies, Cambridge, MA, USA
Abstract :
In this paper, we describe our work on the ASpIRE (Automatic Speech recognition In Reverberant Environments) challenge, which aims to assess the robustness of automatic speech recognition (ASR) systems. The main characteristic of the challenge is developing a high-performance system without access to matched training and development data. While the evaluation data are recorded with far-field microphones in noisy and reverberant rooms, the training data are telephone speech and close talking. Our approach to this challenge includes speech enhancement, neural network methods and acoustic model adaptation, We show that these techniques can successfully alleviate the performance degradation due to noisy audio and data mismatch.
Keywords :
"Training","Speech","Artificial neural networks","Noise measurement","Adaptation models","Acoustics","Robustness"
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on
DOI :
10.1109/ASRU.2015.7404841