DocumentCode :
3268528
Title :
2D psychoacoustic filtering for robust speech recognition
Author :
Dai, Peng ; Soon, Ing Yann ; Yeo, Chai Kiat
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2009
fDate :
8-10 Dec. 2009
Firstpage :
1
Lastpage :
5
Abstract :
One of the weaknesses of speech recognition system is its lack of robustness to background noise as compared to human listeners under similarly conditions. This paper proposes a 2D psychoacoustic modeling algorithm which is integrated with a feature extraction front-end for hidden Markov model (HMM). The proposed algorithm incorporates the properties of human auditory system and applies it to the speech recognition system to enhance its robustness. It integrates forward masking, lateral inhibition and cepstral mean normalization into ordinary mel-frequency cepstral coefficients (MFCC) feature extraction algorithm. Experiments carried out on AURORA2 database show that the word recognition rate can be improved significantly at low computational cost.
Keywords :
acoustic filters; cepstral analysis; feature extraction; filtering theory; hidden Markov models; speech intelligibility; speech recognition; 2D psychoacoustic filtering; background noise; cepstral mean normalization; feature extraction; forward masking; hidden Markov model; human auditory system; lateral inhibition; mel-frequency cepstral coefficient; robust speech recognition; word recognition rate; Background noise; Cepstral analysis; Feature extraction; Filtering; Hidden Markov models; Humans; Mel frequency cepstral coefficient; Noise robustness; Psychology; Speech recognition; 2D Mask; Automatic Speech Recognition; Simultaneous Masking; Temporal Masking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information, Communications and Signal Processing, 2009. ICICS 2009. 7th International Conference on
Conference_Location :
Macau
Print_ISBN :
978-1-4244-4656-8
Electronic_ISBN :
978-1-4244-4657-5
Type :
conf
DOI :
10.1109/ICICS.2009.5397502
Filename :
5397502
Link To Document :
بازگشت