DocumentCode
3410935
Title
Robust speaker identification using auditory features and computational auditory scene analysis
Author
Shao, Yang ; Wang, DeLiang
Author_Institution
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
1589
Lastpage
1592
Abstract
The performance of speaker recognition systems drop significantly under noisy conditions. To improve robustness, we have recently proposed novel auditory features and a robust speaker recognition system using a front-end based on computational auditory scene analysis. In this paper, we further study the auditory features by exploring different feature dimensions and incorporating dynamic features. In addition, we evaluate the features and robust recognition in a speaker identification task in a number of noisy conditions. We find that one of the auditory features performs substantially better than a conventional speaker feature. Furthermore, our recognition system achieves significant performance improvements compared with an advanced front-end in a wide range of signal-to-noise conditions.
Keywords
feature extraction; speaker recognition; auditory features; computational auditory scene analysis; signal-to-noise condition; speaker identification; speaker recognition; Cepstral analysis; Crosstalk; Feature extraction; Humans; Image analysis; Mel frequency cepstral coefficient; Noise robustness; Signal to noise ratio; Speaker recognition; Speech analysis; Gammatone feature; Gammatone frequency cepstral coefficient; Robust speaker recognition; auditory feature; computational auditory scene analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4517928
Filename
4517928
Link To Document