DocumentCode :
2130572
Title :
Acquiring dominant compound terms to build Korean domain knowledge bases
Author :
Jung, Hanmin ; Koo, HeeKwan ; Lee, Byeong-Hee ; Sung, Won-Kyung
Author_Institution :
Korea Inst. of Sci. & Technol. Inf., Taejeon, South Korea
fYear :
2005
fDate :
2005
Firstpage :
2
Lastpage :
7
Abstract :
Compound terms should be well ranked to reduce laborious work for building domain knowledge bases such as term dictionary and thesaurus. Especially, dominant terms in recent years are valuable in the aspects of coverage and reference. We adopt linguistic filtering using a part-of-speech filter and four combination rules to extract Korean compound terms. Domain seed terms are used to obtain their related terms from the above extracted term list. Term ranking, which considers the dominance trend of terms from several year data, assigns term dominance values to the related terms. Experimental results show that our ranking scheme adequately distributes extracted terms than term frequency ordering to reduce the effort of building domain knowledge bases in the manner of term clustering in three groups; growing, declining, and steady.
Keywords :
dictionaries; information filtering; knowledge based systems; linguistics; natural languages; speech processing; thesauri; Korean domain knowledge base; dictionary; domain seed term; dominant compound terms; linguistic filtering; part-of-speech filter; term dominance trend; term dominance value; term extraction; term frequency ordering; term ranking; thesaurus; Buildings; Data mining; Dictionaries; Filtering; Filters; Frequency; Information science; Natural languages; Terminology; Thesauri; Term Dominance Trend; Term Dominance Value; Term Extraction; Term Ranking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Science, 2005. Fourth Annual ACIS International Conference on
Print_ISBN :
0-7695-2296-3
Type :
conf
DOI :
10.1109/ICIS.2005.23
Filename :
1515366
Link To Document :
بازگشت