• DocumentCode
    2163064
  • Title

    PAC-Bayesian approach for minimization of phoneme error rate

  • Author

    Keshet, Joseph ; McAllester, David ; Hazan, Tamir

  • Author_Institution
    Toyota Technol. Inst. at Chicago, Chicago, IL, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    2224
  • Lastpage
    2227
  • Abstract
    We describe a new approach for phoneme recognition which aims at minimizing the phoneme error rate. Building on structured prediction techniques, we formulate the phoneme recognizer as a linear combination of feature functions. We state a PAC-Bayesian generalization bound, which gives an upper-bound on the expected phoneme error rate in terms of the empirical phoneme error rate. Our algorithm is derived by finding the gradient of the PAC-Bayesian bound and minimizing it by stochastic gradient descent. The resulting algorithm is iterative and easy to implement. Experiments on the TIMIT corpus show that our method achieves the lowest phoneme error rate compared to other discriminative and generative models with the same expressive power.
  • Keywords
    Bayes methods; gradient methods; speech recognition; stochastic processes; PAC-Bayesian generalization bound; TIMIT corpus; feature function; iterative algorithm; phoneme error rate minimization; phoneme recognition; phoneme recognizer; stochastic gradient descent; structured prediction technique; Acoustics; Error analysis; Hidden Markov models; Kernel; Speech; Speech recognition; Training; PAC-Bayesian theorem; discriminative training; kernels; phoneme recognition; structured prediction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5946923
  • Filename
    5946923