DocumentCode
3239903
Title
A Comparative Study of Linear and Nonlinear Dimensionality Reduction for Speaker Identification
Author
Errity, Andrew ; McKenna, John
Author_Institution
Dublin City Univ., Dublin
fYear
2007
fDate
1-4 July 2007
Firstpage
587
Lastpage
590
Abstract
In this paper we apply linear and nonlinear dimensionality reduction methods to speech produced by a number of different speakers in an effort to yield low dimensional features capable of discriminating between speakers. The classical linear dimensionality reduction method, principal component analysis (PCA), and the nonlinear manifold learning method, Isomap, are investigated. The resulting features are evaluated in GMM-based speaker identification experiments and compared to conventional cepstral features. Isomap is shown to give the highest accuracy for very low dimensions, outperforming MFCCs and PCA transformed features. Isomap is shown to be useful for visualisation of speaker clusters. For higher dimensions, speaker identification results indicate that features resulting from PCA offer improvements over conventional MFCCs.
Keywords
Gaussian processes; principal component analysis; speaker recognition; Gaussian mixture model; Isomap; linear dimensionality reduction method; nonlinear manifold learning method; principal component analysis; speaker identification; Cepstral analysis; Eigenvalues and eigenfunctions; Geophysics computing; Independent component analysis; Learning systems; Linear discriminant analysis; Mel frequency cepstral coefficient; Principal component analysis; Space technology; Speech processing; GMM; Isomap; PCA; dimensionality reduction; speaker identification;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Signal Processing, 2007 15th International Conference on
Conference_Location
Cardiff
Print_ISBN
1-4244-0882-2
Electronic_ISBN
1-4244-0882-2
Type
conf
DOI
10.1109/ICDSP.2007.4288650
Filename
4288650
Link To Document