DocumentCode
2627898
Title
Korean Part-of-Speech Tagging Using Disambiguation Rules for Ambiguous Word and Statistical information
Author
Ahn, Young-Min ; Seo, Young-Hoon
Author_Institution
Chungbuk Nat. Univ., Cheongju
fYear
2007
fDate
21-23 Nov. 2007
Firstpage
1598
Lastpage
1601
Abstract
In this paper we describe a Korean part-of-speech tagging approach using disambiguation rules for ambiguous word and statistical information. Our tagging approach resolves lexical ambiguities by common rules, rules for individual ambiguous word, and statistical approach. Common rules are ones for idioms and phrases of common use including phrases composed of main and auxiliary verbs. We built disambiguation rules for each word which has several distinct morphological analysis results to enhance tagging accuracy. Each rule may have morphemes, morphological tags, and/or word senses of not only an ambiguous word itself but also words around it. Statistical approach based on HMM is then applied for ambiguous words which are not resolved by rules. Experiment shows that the part-of-speech tagging approach has high accuracy and broad coverage.
Keywords
grammars; hidden Markov models; natural language processing; statistical analysis; HMM; Korean part-of-speech tagging; ambiguous word; disambiguation rule; morphological analysis; statistical information; Data mining; Error correction; Hidden Markov models; Information analysis; Information technology; Natural language processing; Natural languages; Probability; Solid modeling; Tagging;
fLanguage
English
Publisher
ieee
Conference_Titel
Convergence Information Technology, 2007. International Conference on
Conference_Location
Gyeongju
Print_ISBN
0-7695-3038-9
Type
conf
DOI
10.1109/ICCIT.2007.380
Filename
4420481
Link To Document