• DocumentCode
    2972133
  • Title

    Large-margin feature adaptation for automatic speech recognition

  • Author

    Cheng, Chih-Chieh ; Sha, Fei ; Saul, Lawrence K.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of California, San Diego, CA, USA
  • fYear
    2009
  • fDate
    Nov. 13 2009-Dec. 17 2009
  • Firstpage
    87
  • Lastpage
    92
  • Abstract
    We consider how to optimize the acoustic features used by hidden Markov models (HMMs) for automatic speech recognition (ASR). We investigate a mistake-driven algorithm that discriminatively reweights the acoustic features in order to separate the log-likelihoods of correct and incorrect transcriptions by a large margin. The algorithm simultaneously optimizes the HMM parameters in the back end by adapting them to the reweighted features computed by the front end. Using an online approach, we incrementally update feature weights and model parameters after the decoding of each training utterance. To mitigate the strongly biased gradients from individual training utterances, we train several different recognizers in parallel while tying the feature transformations in their front ends. We show that this parameter-tying across different recognizers leads to more stable updates and generally fewer recognition errors.
  • Keywords
    decoding; hidden Markov models; speech coding; speech recognition; automatic speech recognition; hidden Markov models; individual training utterances; large-margin feature adaptation; mistake-driven algorithm; training utterance decoding; update feature weights; Acoustical engineering; Automatic speech recognition; Cepstral analysis; Computer science; Feature extraction; Hidden Markov models; Linear discriminant analysis; Maximum likelihood decoding; Pattern recognition; Signal processing algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
  • Conference_Location
    Merano
  • Print_ISBN
    978-1-4244-5478-5
  • Electronic_ISBN
    978-1-4244-5479-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2009.5373320
  • Filename
    5373320