Title :
Research of Modern Uyghur Word Frequency Statistical Technology
Author :
Azragul ; Nian Mei ; Yasen Yimin
Author_Institution :
Anal. Lab., Xinjiang Normal Univ., Urumqi, China
Abstract :
With the development of our society, the languages are also constantly evolving. Word is the smallest meaningful language composition which able to activity independently, and is also important carrier of knowledge and the basic operation unit in the natural language processing system. Uyghur word frequency statistics technology is the process by computer automatic identification term boundary in the texts. It is the most important pretreatment of information processing technology. However, there is no a really mature Uighur word frequency statistics system, which became one of the bottlenecks that hampered the development of information processing in Uighur language seriously at present. This paper discusses the idea and algorithms of the Uyghur word frequency statistics system in detail. Secondly introduces functional design process of the word frequency statistics system. Third I describe methods and techniques of this system. Finally it states statement of the test results.
Keywords :
natural language processing; statistical analysis; Uighur language; Uyghur word frequency statistical technology; computer automatic identification term boundary; functional design process; information processing technology; language composition; natural language processing system; Algorithm design and analysis; Databases; Dictionaries; Frequency conversion; Standards; Time-frequency analysis; Functional design; Implementation method; Modern Uygur language; Word frequency statistics;
Conference_Titel :
Asian Language Processing (IALP), 2013 International Conference on
Conference_Location :
Urumqi
DOI :
10.1109/IALP.2013.20