Dimensionality reduction for text-independent speaker identification using Gaussian mixture model

Author

El-Gamal, M.A. ; Abu El-Yazeed, M.F. ; El Ayadi, M.M.H.

Author_Institution

Dept. of Eng. Phys. & Math., Cairo Univ., Giza

Volume

2

fYear

2003

fDate

30-30 Dec. 2003

Firstpage

625

Abstract

Reducing the dimensionality of the training and testing data is crucial for text-independent speaker identification tasks. In this paper, the performance of various dimensionality reduction techniques is evaluated for speaker identification systems using Gaussian mixture model (GMM) as the statistical classifier. An enhancement of the standard linear discriminant analysis (LDA) is proposed in which class distributions are assumed to follow Gaussian mixture distribution. This assumption is more appropriate for asymmetric and multimodal class conditional densities. In addition, a new feature selection technique based on the QR factorization method is introduced. Computer simulation results reveal that the proposed modification to the LDA outperforms the standard algorithm in terms of classification accuracy. Moreover, the QR-based selection technique produces comparable results to other prominent dimensionality reduction techniques

Keywords

Gaussian distribution; feature extraction; speaker recognition; Gaussian mixture model; QR factorization method; computer simulation; dimensionality reduction; feature selection; linear discriminant analysis; multimodal class conditional density; statistical classifier; text-independent speaker identification; Computer simulation; Covariance matrix; Feature extraction; Linear discriminant analysis; Loudspeakers; Physics; Principal component analysis; Speech; Testing; Vectors;

fLanguage

English

Publisher

ieee

Conference_Titel

Circuits and Systems, 2003 IEEE 46th Midwest Symposium on

Conference_Location

Cairo

ISSN

1548-3746

Print_ISBN

0-7803-8294-3

Type

conf

DOI

10.1109/MWSCAS.2003.1562364

Filename

1562364