DocumentCode
3268528
Title
2D psychoacoustic filtering for robust speech recognition
Author
Dai, Peng ; Soon, Ing Yann ; Yeo, Chai Kiat
Author_Institution
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear
2009
fDate
8-10 Dec. 2009
Firstpage
1
Lastpage
5
Abstract
One of the weaknesses of speech recognition system is its lack of robustness to background noise as compared to human listeners under similarly conditions. This paper proposes a 2D psychoacoustic modeling algorithm which is integrated with a feature extraction front-end for hidden Markov model (HMM). The proposed algorithm incorporates the properties of human auditory system and applies it to the speech recognition system to enhance its robustness. It integrates forward masking, lateral inhibition and cepstral mean normalization into ordinary mel-frequency cepstral coefficients (MFCC) feature extraction algorithm. Experiments carried out on AURORA2 database show that the word recognition rate can be improved significantly at low computational cost.
Keywords
acoustic filters; cepstral analysis; feature extraction; filtering theory; hidden Markov models; speech intelligibility; speech recognition; 2D psychoacoustic filtering; background noise; cepstral mean normalization; feature extraction; forward masking; hidden Markov model; human auditory system; lateral inhibition; mel-frequency cepstral coefficient; robust speech recognition; word recognition rate; Background noise; Cepstral analysis; Feature extraction; Filtering; Hidden Markov models; Humans; Mel frequency cepstral coefficient; Noise robustness; Psychology; Speech recognition; 2D Mask; Automatic Speech Recognition; Simultaneous Masking; Temporal Masking;
fLanguage
English
Publisher
ieee
Conference_Titel
Information, Communications and Signal Processing, 2009. ICICS 2009. 7th International Conference on
Conference_Location
Macau
Print_ISBN
978-1-4244-4656-8
Electronic_ISBN
978-1-4244-4657-5
Type
conf
DOI
10.1109/ICICS.2009.5397502
Filename
5397502
Link To Document