• DocumentCode
    2065329
  • Title

    Discriminative Output Coding Features for Speech Recognition

  • Author

    Dehzangi, Omid ; Ma, Bin ; Chng, Eng Siong ; Li, Haizhou

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2008
  • fDate
    16-19 Dec. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper presents a novel approach of discriminative acoustic feature extraction for speech recognition using output coding technique. A high dimensional feature space for higher discriminative capability is constructed by expanding MFCC coefficients with polynomial expansion. In order to fit the discriminative features in the hidden Markov model structure of speech recognition, the high dimensional feature vectors are further projected into a low dimensional feature space using the output scores of a set of SVMs. Each of the SVMs is trained in one phone versus the rest manner so that each of the resulting feature dimensions can provide effective information to differ one phone from the others. The discriminative features have been evaluated in the speech recognition task of the TIMIT corpus, and 72.18% phone accuracy has been achieved.
  • Keywords
    feature extraction; hidden Markov models; speech coding; speech recognition; support vector machines; discriminative acoustic feature extraction; discriminative output coding features; hidden Markov model structure; polynomial expansion; speech recognition; support vector machine; Acoustical engineering; Automatic speech recognition; Feature extraction; Hidden Markov models; Linear discriminant analysis; Maximum likelihood estimation; Mel frequency cepstral coefficient; Polynomials; Speech recognition; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
  • Conference_Location
    Kunming
  • Print_ISBN
    978-1-4244-2942-4
  • Electronic_ISBN
    978-1-4244-2943-1
  • Type

    conf

  • DOI
    10.1109/CHINSL.2008.ECP.34
  • Filename
    4730288