DocumentCode :
2065329
Title :
Discriminative Output Coding Features for Speech Recognition
Author :
Dehzangi, Omid ; Ma, Bin ; Chng, Eng Siong ; Li, Haizhou
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2008
fDate :
16-19 Dec. 2008
Firstpage :
1
Lastpage :
4
Abstract :
This paper presents a novel approach of discriminative acoustic feature extraction for speech recognition using output coding technique. A high dimensional feature space for higher discriminative capability is constructed by expanding MFCC coefficients with polynomial expansion. In order to fit the discriminative features in the hidden Markov model structure of speech recognition, the high dimensional feature vectors are further projected into a low dimensional feature space using the output scores of a set of SVMs. Each of the SVMs is trained in one phone versus the rest manner so that each of the resulting feature dimensions can provide effective information to differ one phone from the others. The discriminative features have been evaluated in the speech recognition task of the TIMIT corpus, and 72.18% phone accuracy has been achieved.
Keywords :
feature extraction; hidden Markov models; speech coding; speech recognition; support vector machines; discriminative acoustic feature extraction; discriminative output coding features; hidden Markov model structure; polynomial expansion; speech recognition; support vector machine; Acoustical engineering; Automatic speech recognition; Feature extraction; Hidden Markov models; Linear discriminant analysis; Maximum likelihood estimation; Mel frequency cepstral coefficient; Polynomials; Speech recognition; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2942-4
Electronic_ISBN :
978-1-4244-2943-1
Type :
conf
DOI :
10.1109/CHINSL.2008.ECP.34
Filename :
4730288
Link To Document :
بازگشت