DocumentCode
1565771
Title
Modeling Human Auditory Perception for Noise-Robust Speech Recognition
Author
Soo-Young Lee
Author_Institution
Dept. of BioSystems, Korea Adv. Inst. of Sci. & Technol., Daejeon
Volume
3
fYear
2005
Abstract
Several bio-inspired models of human auditory perception are reported for robust speech recognition in real-world noisy environment. The developed mathematical models of the human auditory pathway are integrated into a speech recognition system, of which 3 components are (1) the nonlinear feature extraction model from cochlea to auditory cortex, (2) the binaural processing model at superior olivery complex, and (3) the top-down attention model from higher brain to the cochlea. The unsupervised independent component analysis shows that some auditory feature extraction and binaural processing mechanisms follow information theory with sparse representation. The ICA-based features resemble frequency-limited features extracted from the cochlea and also more complex time-frequency features from the inferior colliculus and auditory cortex. The top-down attention model shows how the pre-acquired knowledge in our brain filters out irrelevant features or fills in missing features in the sensory data. Both the top-down attention and bottom-up binaural processing are combined into a single system for high-noisy cases. This auditory model requires extensive computing, and several VLSI implementations had been developed for real-time applications. Experimental results demonstrate much better recognition performance in realworld noisy environments
Keywords
feature extraction; hearing; independent component analysis; speech recognition; auditory cortex; binaural processing model; cochlea; human auditory pathway; human auditory perception; noise-robust speech recognition; nonlinear feature extraction model; superior olivery complex; top-down attention model; unsupervised independent component analysis; Brain modeling; Feature extraction; Frequency; Humans; Independent component analysis; Information theory; Mathematical model; Noise robustness; Speech recognition; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks and Brain, 2005. ICNN&B '05. International Conference on
Conference_Location
Beijing
Print_ISBN
0-7803-9422-4
Type
conf
DOI
10.1109/ICNNB.2005.1614867
Filename
1614867
Link To Document