Title :
Automatic Construction of Domain Concept Hierarchy
Author :
Qiao, Sun ; Chunhui, Zhang ; Zhibo, Chen
Author_Institution :
Sch. of Comput. Sci. & Eng., Beihang Univ., Beijing, China
Abstract :
A general automatic domain concept hierarchy construction procedure is presented in this paper. This is a domain independent construct a domain concept hierarchy from a domain corpus. The construction procedure mainly includes domain terminology extraction, word sense disambiguation, similarity computation, hierarchy construction and subsumption relation detection. All extracted candidate terms are ranked first, then one can select the top terms as domain terminologies. Frequency ratio and entropy of a word are considered to rank candidate terms. Relations between terms are taken into account for words in WordNet, while distributional similarity is used to compute similarity between words outside WordNet. Experiments on two domain corpus show that the proposed procedure is feasible and can get reasonable concept hierarchy.
Keywords :
Internet; entropy; information retrieval; text analysis; word processing; WordNet; automatic domain concept hierarchy construction; domain corpus; domain terminology extraction; frequency ratio; similarity computation; subsumption relation detection; word entropy; word sense disambiguation; Entropy; Feature extraction; Frequency domain analysis; Learning; Terminology; construction; domain concept; extraction; hierarchy;
Conference_Titel :
Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), 2010 International Conference on
Conference_Location :
Huangshan
Print_ISBN :
978-1-4244-8434-8
Electronic_ISBN :
978-0-7695-4235-5
DOI :
10.1109/CyberC.2010.85