Title :
Compound Noun Analysis for Process of Korean Unregistered Word
Author :
Jin, Guanghe ; Li, Zhanguo ; Qu, Dapeng ; Wang, Xingwei
Author_Institution :
Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
Abstract :
In this paper, a new method of compound noun analysis is proposed. It uses decomposition model and unregistered words recognition. The latter contains loanword nouns, name nouns and place name. Loanword noun is recognized based on it´s formed by syllables which have low use frequency in Korean syllable. Name noun is recognized based on the characteristics of Korean name and the repeatability of the certain amount of syllables. Place name is recognized by consulting place name dictionary based on the analysis of model that the place names appear. The experimental results for compound noun analysis have an average accuracy of 98.2%. The accuracy of loanword nouns recognition and name nouns recognition is 92% and 95% separately.
Keywords :
word processing; Korean unregistered word recognition; compound noun analysis; decomposition model; loanword noun; name nouns recognition; Accuracy; Algorithm design and analysis; Analytical models; Character recognition; Compounds; Dictionaries; Educational institutions; Korean; compound noun; loanword; name noun; place name; unregistered word;
Conference_Titel :
Computational and Information Sciences (ICCIS), 2012 Fourth International Conference on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4673-2406-9
DOI :
10.1109/ICCIS.2012.109