DocumentCode :
3326246
Title :
Speaker recognition based on transformed line spectral frequencies
Author :
Lee, Bong Jin ; Kim, Samuel ; Kang, Hong-Goo
Author_Institution :
MCSP, Yonsei Univ., Seoul, South Korea
fYear :
2004
fDate :
18-19 Nov. 2004
Firstpage :
177
Lastpage :
180
Abstract :
Line spectral frequencies (LSF) and five types of transformed LSF are studied for robust text-independent speaker identification. Transformations are constructed by considering physical aspects of the vocal tract. These aspects are: location of formants/s; bandwidth of formants/s; bandwidth and location of formants; bandwidth and location of s; interval of adjacent formant and locations. Identification tests using the TIMIT database verify that all features are useful for speaker recognition; the bandwidth and location of formants, especially, show the best performance. Simulation results also show that LSF and some of the transformed LSF give better performance than Mel-frequency cepstral coefficient (MFCC).
Keywords :
Gaussian processes; covariance matrices; speaker recognition; spectral analysis; Gaussian mixture model; Mel-frequency cepstral coefficient; TIMIT database; bandwidth; diagonal covariance matrices; formant bandwidth; formant location; location; speaker recognition; text-independent speaker identification; transformed line spectral frequencies; vocal tract physical aspects; Bandwidth; Cepstral analysis; Equations; Linear predictive coding; Mel frequency cepstral coefficient; Polynomials; Robustness; Spatial databases; Speaker recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Signal Processing and Communication Systems, 2004. ISPACS 2004. Proceedings of 2004 International Symposium on
Print_ISBN :
0-7803-8639-6
Type :
conf
DOI :
10.1109/ISPACS.2004.1439040
Filename :
1439040
Link To Document :
بازگشت