DocumentCode :
3239903
Title :
A Comparative Study of Linear and Nonlinear Dimensionality Reduction for Speaker Identification
Author :
Errity, Andrew ; McKenna, John
Author_Institution :
Dublin City Univ., Dublin
fYear :
2007
fDate :
1-4 July 2007
Firstpage :
587
Lastpage :
590
Abstract :
In this paper we apply linear and nonlinear dimensionality reduction methods to speech produced by a number of different speakers in an effort to yield low dimensional features capable of discriminating between speakers. The classical linear dimensionality reduction method, principal component analysis (PCA), and the nonlinear manifold learning method, Isomap, are investigated. The resulting features are evaluated in GMM-based speaker identification experiments and compared to conventional cepstral features. Isomap is shown to give the highest accuracy for very low dimensions, outperforming MFCCs and PCA transformed features. Isomap is shown to be useful for visualisation of speaker clusters. For higher dimensions, speaker identification results indicate that features resulting from PCA offer improvements over conventional MFCCs.
Keywords :
Gaussian processes; principal component analysis; speaker recognition; Gaussian mixture model; Isomap; linear dimensionality reduction method; nonlinear manifold learning method; principal component analysis; speaker identification; Cepstral analysis; Eigenvalues and eigenfunctions; Geophysics computing; Independent component analysis; Learning systems; Linear discriminant analysis; Mel frequency cepstral coefficient; Principal component analysis; Space technology; Speech processing; GMM; Isomap; PCA; dimensionality reduction; speaker identification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Signal Processing, 2007 15th International Conference on
Conference_Location :
Cardiff
Print_ISBN :
1-4244-0882-2
Electronic_ISBN :
1-4244-0882-2
Type :
conf
DOI :
10.1109/ICDSP.2007.4288650
Filename :
4288650
Link To Document :
بازگشت