Modeling Human Auditory Perception for Noise-Robust Speech Recognition

Author

Soo-Young Lee

Author_Institution

Dept. of BioSystems, Korea Adv. Inst. of Sci. & Technol., Daejeon

Volume

3

fYear

2005

Abstract

Several bio-inspired models of human auditory perception are reported for robust speech recognition in real-world noisy environment. The developed mathematical models of the human auditory pathway are integrated into a speech recognition system, of which 3 components are (1) the nonlinear feature extraction model from cochlea to auditory cortex, (2) the binaural processing model at superior olivery complex, and (3) the top-down attention model from higher brain to the cochlea. The unsupervised independent component analysis shows that some auditory feature extraction and binaural processing mechanisms follow information theory with sparse representation. The ICA-based features resemble frequency-limited features extracted from the cochlea and also more complex time-frequency features from the inferior colliculus and auditory cortex. The top-down attention model shows how the pre-acquired knowledge in our brain filters out irrelevant features or fills in missing features in the sensory data. Both the top-down attention and bottom-up binaural processing are combined into a single system for high-noisy cases. This auditory model requires extensive computing, and several VLSI implementations had been developed for real-time applications. Experimental results demonstrate much better recognition performance in realworld noisy environments

Keywords

feature extraction; hearing; independent component analysis; speech recognition; auditory cortex; binaural processing model; cochlea; human auditory pathway; human auditory perception; noise-robust speech recognition; nonlinear feature extraction model; superior olivery complex; top-down attention model; unsupervised independent component analysis; Brain modeling; Feature extraction; Frequency; Humans; Independent component analysis; Information theory; Mathematical model; Noise robustness; Speech recognition; Working environment noise;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks and Brain, 2005. ICNN&B '05. International Conference on

Conference_Location

Beijing

Print_ISBN

0-7803-9422-4

Type

conf

DOI

10.1109/ICNNB.2005.1614867

Filename

1614867