مرکز منطقه ای اطلاع رساني علوم و فناوري - Discriminative Output Coding Features for Speech Recognition

DocumentCode :

2065329

Title :

Discriminative Output Coding Features for Speech Recognition

Author :

Dehzangi, Omid ; Ma, Bin ; Chng, Eng Siong ; Li, Haizhou

Author_Institution :

Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore

fYear :

2008

fDate :

16-19 Dec. 2008

Firstpage :

Lastpage :

Abstract :

This paper presents a novel approach of discriminative acoustic feature extraction for speech recognition using output coding technique. A high dimensional feature space for higher discriminative capability is constructed by expanding MFCC coefficients with polynomial expansion. In order to fit the discriminative features in the hidden Markov model structure of speech recognition, the high dimensional feature vectors are further projected into a low dimensional feature space using the output scores of a set of SVMs. Each of the SVMs is trained in one phone versus the rest manner so that each of the resulting feature dimensions can provide effective information to differ one phone from the others. The discriminative features have been evaluated in the speech recognition task of the TIMIT corpus, and 72.18% phone accuracy has been achieved.

Keywords :

feature extraction; hidden Markov models; speech coding; speech recognition; support vector machines; discriminative acoustic feature extraction; discriminative output coding features; hidden Markov model structure; polynomial expansion; speech recognition; support vector machine; Acoustical engineering; Automatic speech recognition; Feature extraction; Hidden Markov models; Linear discriminant analysis; Maximum likelihood estimation; Mel frequency cepstral coefficient; Polynomials; Speech recognition; Support vector machines;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on

Conference_Location :

Kunming

Print_ISBN :

978-1-4244-2942-4

Electronic_ISBN :

978-1-4244-2943-1

Type :

conf

DOI :

10.1109/CHINSL.2008.ECP.34

Filename :

4730288

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2065329