Title :
Neuromorphic speech processing for noisy environments
Author :
Neti, Chalapathy
Author_Institution :
IBM Corp., Boca Raton, FL, USA
fDate :
27 Jun-2 Jul 1994
Abstract :
Current speech recognition systems perform very poorly in the presence of background noise, particularly for signal-to-noise ratios (SNR) below 10 dB and for certain noise conditions such as cafeteria noise. In this study we investigate the use of acoustic processing based on cochlear models and neural-like processing as a means of arriving at noise robust acoustic representation of speech. However, unlike previous work based on cochlear models that used cochlear filter parameters based on neurophysiological data, we optimize cochlear filter shape and thresholds to reduce the noise contribution in the resulting acoustic representations. Results suggest that average SNR improvements of the order of 5-10 dB can be obtained for noise corrupted signals with SNRs near 0-6 dB for realistic noise such as cafeteria noise. Furthermore, using a neural network to include context and arrive at a lower dimensional representation can lead to further improvements in SNR
Keywords :
acoustic noise; acoustic signal processing; filtering theory; neural nets; physiological models; speech recognition; acoustic processing; acoustic representation; auditory model; cafeteria noise; cochlear filter; cochlear models; neural network; neuromorphic speech processing; noisy environments; signal-to-noise ratios; speech recognition; Acoustic noise; Background noise; Filters; Neuromorphics; Noise robustness; Noise shaping; Signal to noise ratio; Speech processing; Speech recognition; Working environment noise;
Conference_Titel :
Neural Networks, 1994. IEEE World Congress on Computational Intelligence., 1994 IEEE International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
0-7803-1901-X
DOI :
10.1109/ICNN.1994.374982