DocumentCode :
352360
Title :
Maximum likelihood discriminant feature spaces
Author :
Saon, George ; Padmanabhan, Mukund ; Gopinath, Ramesh ; Chen, Scott
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
2
fYear :
2000
fDate :
2000
Abstract :
Linear discriminant analysis (LDA) is known to be inappropriate for the case of classes with unequal sample covariances. There has been an interest in generalizing LDA to heteroscedastic discriminant analysis (HDA) by removing the equal within-class covariance constraint. This paper presents a new approach to HDA by defining an objective function which maximizes the class discrimination in the projected subspace while ignoring the rejected dimensions. Moreover, we investigate the link between discrimination and the likelihood of the projected samples and show that HDA can be viewed as a constrained ML projection for a full covariance Gaussian model, the constraint being given by the maximization of the projected between-class scatter volume. It is shown that, under diagonal covariance Gaussian modeling constraints, applying a diagonalizing linear transformation (MLLT) to the HDA space results in increased classification accuracy even though HDA alone actually degrades the recognition performance. Experiments performed on the Switchboard and Voicemail databases show a 10%-13% relative improvement in the word error rate over standard cepstral processing
Keywords :
Gaussian processes; cepstral analysis; maximum likelihood estimation; speech recognition; HDA; class discrimination; constrained ML projection; diagonal covariance Gaussian modeling constraints; diagonalizing linear transformation; equal within-class covariance constraint; full covariance Gaussian model; heteroscedastic discriminant analysis; maximization; maximum likelihood discriminant feature spaces; objective function; projected between-class scatter volume; projected subspace; speech recognition; word error rate; Acoustic scattering; Cepstral analysis; Covariance matrix; Databases; Degradation; Error analysis; Linear discriminant analysis; Performance analysis; Speech recognition; Voice mail;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.859163
Filename :
859163
Link To Document :
بازگشت