• DocumentCode
    1364822
  • Title

    Round-Robin Duel Discriminative Language Models

  • Author

    Oba, Takanobul ; Hori, Takaaki ; Nakamura, Atsushi ; Ito, Akinori

  • Author_Institution
    NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
  • Volume
    20
  • Issue
    4
  • fYear
    2012
  • fDate
    5/1/2012 12:00:00 AM
  • Firstpage
    1244
  • Lastpage
    1255
  • Abstract
    Discriminative training has received a lot of attention from both the machine learning and speech recognition communities. The idea behind the discriminative approach is to construct a model that distinguishes correct samples from incorrect samples, while the conventional generative approach estimates the distributions of correct samples. We propose a novel discriminative training method and apply it to a language model for reranking speech recognition hypotheses. Our proposed method has round-robin duel discrimination (R2D2) criteria in which all the pairs of sentence hypotheses including pairs of incorrect sentences are distinguished from each other, taking their error rate into account. Since the objective function is convex, the global optimum can be found through a normal parameter estimation method such as the quasi-Newton method. Furthermore, the proposed method is an expansion of the global conditional log-linear model whose objective function corresponds to the conditional random fields. Our experimental results show that R2D2 outperforms conventional methods in many situations, including different languages, different feature constructions and different difficulties.
  • Keywords
    learning (artificial intelligence); natural language processing; parameter estimation; speech recognition; conditional random fields; discriminative training; error rate; feature constructions; global conditional log-linear model; machine learning; normal parameter estimation method; objective function; quasiNewton method; round-robin duel discriminative language models; sentence hypotheses; speech recognition communities; Data models; Error analysis; Speech; Speech processing; Speech recognition; Training; Vectors; Discriminative language model; error correction; round-robin duel discrimination (R2D2);
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2011.2174225
  • Filename
    6064876