Title :
Analog auditory perception model for robust speech recognition
Author :
Deng, Yunbin ; Chakrabartty, Shantanu ; Cauwenberghs, Gert
Author_Institution :
Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
An auditory perception model for noise-robust speech feature extraction is presented. The model assumes continuous-time filtering and rectification, amenable to real-time, low-power analog VLSI implementation. A 3 mm×3 mm CMOS chip in 0.5 μm CMOS technology implements the general form of the model with digitally programmable filter parameters. Experiments on the TI-DIGIT database demonstrate consistent robustness of the new features to noise of various statistics, yielding significant improvements in digit recognition accuracy over models identically trained using Mel-scale frequency cepstral coefficient (MFCC) features.
Keywords :
CMOS analogue integrated circuits; VLSI; continuous time filters; digital filters; feature extraction; filtering theory; hearing; low-power electronics; programmable filters; speech recognition; statistics; 0.5 micron; CMOS chip; CMOS technology; Mel-scale frequency cepstral coefficient; TI-DIGIT database; analog auditory perception model; continuous time filtering; continuous time rectification; digit recognition accuracy; digital programmable filter parameters; low power analog VLSI implementation; noise-robust speech feature extraction; robust speech recognition; robustness; statistics; CMOS technology; Digital filters; Feature extraction; Filtering; Mel frequency cepstral coefficient; Noise robustness; Semiconductor device modeling; Spatial databases; Speech recognition; Very large scale integration;
Conference_Titel :
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
Print_ISBN :
0-7803-8359-1
DOI :
10.1109/IJCNN.2004.1380859