DocumentCode
302074
Title
Robust distant-talking speech recognition
Author
Pearson, J. ; Lin, Q. ; Che, C. ; Yuk, D.-S. ; Jin, L. ; DeVries, Ben ; Flanagan, J.
Author_Institution
CAIP Center, Rutgers Univ., Piscataway, NJ, USA
Volume
1
fYear
1996
fDate
7-10 May 1996
Firstpage
21
Abstract
Most contemporary speech recognizers are designed to operate with close-talking speech and they work best in a quiet laboratory condition. There is an apparent need to render environment robustness to these systems. The objective of the paper is to explore utility of existing speech recognition technology in adverse “real-world” environments for distant-talking applications. A synergistic system consisting of microphone array and neural network (MANN) is utilized to mitigate environmental interference introduced by reverberation, ambient noise, and channel mismatch between training and testing conditions. The MANN system is evaluated with experiments on continuous distant-talking speech recognition. The results show that the MANN system elevates the word recognition accuracy to a level which is competitive with a retrained speech recognizer and that the neural network compensation performs better than some previously researched techniques
Keywords
acoustic transducer arrays; feedforward neural nets; interference (signal); microphones; multilayer perceptrons; noise; reverberation; speech recognition; ambient noise; channel mismatch; close talking speech; continuous distant-talking speech recognition; environment robustness; environmental interference; experiments; feedforward multilayer perceptron; microphone array; neural network compensation; real world environments; retrained speech recognizer; reverberation; robust distant talking speech recognition; speech recognition technology; synergistic system; testing conditions; training conditions; word recognition accuracy; Interference; Laboratories; Microphone arrays; Neural networks; Noise robustness; Paper technology; Reverberation; Speech recognition; System testing; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1520-6149
Print_ISBN
0-7803-3192-3
Type
conf
DOI
10.1109/ICASSP.1996.540280
Filename
540280
Link To Document