• DocumentCode
    2669287
  • Title

    Research on Some Key Technologies of Tibetan Automatic Word Segmentation

  • Author

    Sun, Yuan ; Yan, Xiaodong ; Zhao, Xiaobing ; Yang, Guosheng

  • Author_Institution
    Sch. of Inf. Eng., Minzu Univ. of China, Beijing, China
  • fYear
    2011
  • fDate
    1-3 Nov. 2011
  • Firstpage
    188
  • Lastpage
    191
  • Abstract
    This paper researches on some key technologies of Tibetan automatic word segmentation. We propose a Tibetan automatic word segmentation approach, which is taking the advantage of case-auxiliary words and continuous feature. Meanwhile, a resolution method of overlapping ambiguity in Tibetan word segmentation is proposed, which is based on forward-backward scanning identification method and improved maximum probability algorithm. Finally, an experiment is conducted, and the results prove the algorithm is effective.
  • Keywords
    natural language processing; probability; word processing; Tibetan automatic word segmentation; case-auxiliary word; forward-backward scanning identification method; maximum probability algorithm; word segmentation ambiguity; Accuracy; Dictionaries; Educational institutions; Feature extraction; Grammar; Information processing; Text processing; Tibetan word segmentation; case-auxiliary words; continuous features; overlapping ambiguity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Networks and Intelligent Systems (ICINIS), 2011 4th International Conference on
  • Conference_Location
    Kunming
  • Print_ISBN
    978-1-4577-1626-3
  • Type

    conf

  • DOI
    10.1109/ICINIS.2011.43
  • Filename
    6104725