Title :
Multi-thread Multi-keywords Matching Approach for Uyghur Text
Author :
Xinyuan Zhao ; Adili
Author_Institution :
Lab. for Network Inf. Security & Public Opinion Anal., Xinjiang Normal Univ., Urumqi, China
Abstract :
Keywords matching is a preliminary means in public opinion analysis. Uyghur language is an agglutinative language, which words can be attaching by suffixes to express different semantic or syntactic in the text. Therefore, traditional matching algorithm can not be applied directly to the Uyghur text due to the Uyghur words have different surface forms in the text. In this paper, we implement a multi-keywords matching algorithm based on automaton for Uyghur text. The algorithm handles the inflection suffixes and the weakening of vowel letter in the word by use of reseverse suffixes automata and weakening of vowel restoration automata. By classification the keywords automata on the first letter of each keyword, a general multi-thread keywords matching approach for Uyghur also be proposed.
Keywords :
automata theory; multi-threading; natural languages; pattern matching; text analysis; Uyghur language; Uyghur text; agglutinative language; multithread multikeywords matching approach; public opinion analysis; vowel restoration automata; Automata; Doped fiber amplifiers; Educational institutions; Instruction sets; Joining processes; Pattern matching; Signal processing algorithms; Uyghur text; automata; matching; vowel;
Conference_Titel :
Asian Language Processing (IALP), 2013 International Conference on
Conference_Location :
Urumqi
DOI :
10.1109/IALP.2013.36