DocumentCode
2669287
Title
Research on Some Key Technologies of Tibetan Automatic Word Segmentation
Author
Sun, Yuan ; Yan, Xiaodong ; Zhao, Xiaobing ; Yang, Guosheng
Author_Institution
Sch. of Inf. Eng., Minzu Univ. of China, Beijing, China
fYear
2011
fDate
1-3 Nov. 2011
Firstpage
188
Lastpage
191
Abstract
This paper researches on some key technologies of Tibetan automatic word segmentation. We propose a Tibetan automatic word segmentation approach, which is taking the advantage of case-auxiliary words and continuous feature. Meanwhile, a resolution method of overlapping ambiguity in Tibetan word segmentation is proposed, which is based on forward-backward scanning identification method and improved maximum probability algorithm. Finally, an experiment is conducted, and the results prove the algorithm is effective.
Keywords
natural language processing; probability; word processing; Tibetan automatic word segmentation; case-auxiliary word; forward-backward scanning identification method; maximum probability algorithm; word segmentation ambiguity; Accuracy; Dictionaries; Educational institutions; Feature extraction; Grammar; Information processing; Text processing; Tibetan word segmentation; case-auxiliary words; continuous features; overlapping ambiguity;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Networks and Intelligent Systems (ICINIS), 2011 4th International Conference on
Conference_Location
Kunming
Print_ISBN
978-1-4577-1626-3
Type
conf
DOI
10.1109/ICINIS.2011.43
Filename
6104725
Link To Document