• DocumentCode
    3422280
  • Title

    Boosted MMI for model and feature-space discriminative training

  • Author

    Povey, Daniel ; Kanevsky, Dimitri ; Kingsbury, Brian ; Ramabhadran, Bhuvana ; Saon, George ; Visweswariah, Karthik

  • Author_Institution
    TJ. Watson Res. Center, IBM, Yorktown Heights, NY
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4057
  • Lastpage
    4060
  • Abstract
    We present a modified form of the maximum mutual information (MMI) objective function which gives improved results for discriminative training. The modification consists of boosting the likelihoods of paths in the denominator lattice that have a higher phone error relative to the correct transcript, by using the same phone accuracy function that is used in Minimum Phone Error (MPE) training. We combine this with another improvement to our implementation of the Extended Baum-Welch update equations for MMI, namely the canceling of any shared part of the numerator and denominator statistics on each frame (a procedure that is already done in MPE). This change affects the Gaussian-specific learning rate. We also investigate another modification whereby we replace I-smoothing to the ML estimate with I-smoothing to the previous iteration´s value. Boosted MMI gives better results than MPE in both model and feature-space discriminative training, although not consistently.
  • Keywords
    Gaussian processes; feature extraction; speech recognition; Baum-Welch update equations; Gaussian-specific learning rate; denominator lattice; feature space discriminative training; maximum mutual information; minimum phone error; objective function; phone accuracy function; Boosting; Equations; Error correction; Gaussian processes; Hidden Markov models; Lattices; Maximum likelihood estimation; Mutual information; Speech recognition; Statistics; Discriminative Training; MMI; MPE; Maximum Margin; Speech Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518545
  • Filename
    4518545