• DocumentCode
    178426
  • Title

    Narrow Adaptive Regularization of weights for grapheme-to-phoneme conversion

  • Author

    Kubo, Koichi ; Sakti, Sakriani ; Neubig, Graham ; Toda, Takechi ; Nakamura, Shigenari

  • Author_Institution
    Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol. (NAIST), Ikoma, Japan
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    2589
  • Lastpage
    2593
  • Abstract
    As the speech recognition field proceeds to open domain and multilingual tasks, the need for robust g2p conversion has been increasing. Towards this objective, we propose a new g2p conversion training method based on the Narrow Adaptive Regularization of Weights (NAROW) online learning algorithm. NAROW improves over its predecessor AROW by automatically adjusting hyperparameters to reduce mistake bounds, and ensuring that the learning rate is not updated when features for the input data have already been updated enough. The contribution of this paper is first to extend NAROW to structured learning, and show the inequality to bound the maximum number of errors in structured NAROW. In experiments, our proposed approach significantly improved over MIRA with consistent phoneme error rate reductions of 1.3-3.8% on a variety of dictionaries.
  • Keywords
    computer aided instruction; speech recognition; MIRA; NAROW; domain task; g2p conversion training method; grapheme-to-phoneme conversion; multilingual task; narrow adaptive regularization of weights online learning algorithm; online discriminative training; out-of-vocabulary word; speech recognition; Context; Convex functions; Error analysis; Joints; Prediction algorithms; Training; Vectors; NAROW; g2p conversion; online discriminative training; out-of-vocabulary word; structured learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854068
  • Filename
    6854068