• DocumentCode
    1749720
  • Title

    Kanji-to-hiragana conversion based on a language model

  • Author

    Chang, Wei-Bin

  • Author_Institution
    Philips Res. East Asia - Taipei, Taiwan
  • Volume
    1
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    585
  • Abstract
    In speech recognition systems, a common problem is transcription of new additions to the recognition lexicon into their phonetic symbols. Specific to the Japanese language, such a problem can be dealt with in two steps. We focus on the first step, in which the new lexical entry is converted into a set of hiragana syllabaries, which is almost a phonetic transcription. We propose a conversion scheme which yields the most likely hiragana syllabaries, based on a language model. Results from our evaluations on three test sets are also reported. Although the study is conducted on Japanese only, our approach has applications to Chinese
  • Keywords
    natural languages; probability; speech recognition; statistical analysis; Chinese; Japanese language; conversion scheme; kanji-to-hiragana conversion; language model; phonetic symbols; phonetic transcription; recognition lexicon; speech recognition systems; Asia; Automatic speech recognition; Dictionaries; Natural languages; Speech recognition; Testing; White spaces; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.940899
  • Filename
    940899