Title :
Auditory front-ends for noise-robust automatic speech recognition
Author :
Yeh, Ja-Zang ; Chen, Chia-Ping
Author_Institution :
Dept. of Comput. Sci. & Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan
fDate :
Nov. 29 2010-Dec. 3 2010
Abstract :
In this paper we investigate a noise-robust feature extraction method, which is based on the auditory masking effect, for automatic speech recognition systems. We physically model the basilar membrane as a cascade system of simple harmonic oscillators, and mathematically analyze the motion of the basilar membrane due to speech signals. Based on the analysis, we can identify a correlational factor for the coupled motion of the oscillators, which can be used to partially explain the masking effect. Accordingly, we insert an auditory module in the speech feature extraction process. The proposed methodology is evaluated on the Aurora 2.0 noisy-digit speech database, and it achieves significant improvements.
Keywords :
cascade systems; feature extraction; speech intelligibility; speech recognition; auditory front end; auditory masking effect; basilar membrane; cascade system; correlational factor; coupled motion; noise robust automatic speech recognition; noise robust feature extraction method; noisy digit speech database; simple harmonic oscillator; speech signal; Biomembranes; Feature extraction; Harmonic analysis; Noise measurement; Oscillators; Speech; Speech recognition; auditory model; frequency masking; noise robustness; speech recognition;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
DOI :
10.1109/ISCSLP.2010.5684910