• DocumentCode
    3166752
  • Title

    Discriminative approach to lexical entry selection for automatic speech recognition of agglutinative language

  • Author

    Ablimit, Mijit ; Kawahara, Tatsuya ; Hamdulla, Askar

  • Author_Institution
    Sch. of Inf., Kyoto Univ., Kyoto, Japan
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    5009
  • Lastpage
    5012
  • Abstract
    In agglutinative languages, selection of lexical unit is not obvious. Morpheme unit is usually adopted to ensure the sufficient coverage, but many morphemes are short, resulting in weak constraints and possible confusions. In this paper, we propose a discriminative approach to select lexical entries which will directly contribute to ASR error reduction. We define an evaluation function for each word by a set of features and their weights, and the measure for optimization by the difference of WERs by the morpheme-based model and by the word-based model. Then, the weights of the features are learned by a perceptron algorithm. Finally, word (or sub-word) entries with higher evaluation scores are selected to be added to the lexicon. This method is successfully applied to an Uyghur large-vocabulary continuous speech recognition system, resulting in a significant reduction of WER and the lexicon size. Further improvement is achieved by combining with a statistical method based on mutual information criterion.
  • Keywords
    optimisation; speech recognition; statistical analysis; ASR error reduction; Uyghur large-vocabulary continuous speech recognition system; WER; agglutinative language; automatic speech recognition; discriminative approach; lexical entry selection; lexicon; morpheme unit; morpheme-based model; optimization; statistical method; word-based model; Computational modeling; Data models; Hidden Markov models; Mutual information; Optimization; Speech recognition; Training; Uyghur; discriminative learning; language model; morpheme; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6289045
  • Filename
    6289045