DocumentCode :
3300216
Title :
Automated Construction Chinese Domain Ontology from Wikipedia
Author :
Lian, Li ; Ma, Jun ; Lei, JingSheng ; Song, Ling ; Liu, LeBo
Author_Institution :
Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan
Volume :
2
fYear :
2008
fDate :
18-20 Oct. 2008
Firstpage :
670
Lastpage :
674
Abstract :
Wikipedia (Wiki) is a collaborative on-line encyclopedia, where Web users are able to share their knowledge about a certain topic. How to make use of the rich knowledge in the Wiki is a big challenge. In this paper we propose a method to construct domain ontology from the Chinese Wiki automatically. The main idea in this paper is based on the entry segmenting and feature text (FT) extracting, where we segment the name of entries and establish the concept hierarchy firstly. Secondly, we extract the FTs from the descriptions of entries to eliminate the redundant information. Finally we calculate the similarity between pairs of FTs to revise the concept hierarchy and gain non-taxonomy relations between concepts. The primary experiment indicates that our method is useful for Chinese domain ontology construction.
Keywords :
Web sites; feature extraction; natural language processing; ontologies (artificial intelligence); text analysis; Wikipedia; automated Chinese domain ontology construction; collaborative online encyclopedia; concept hierarchy; entry description; entry segmentation; feature text extraction; nontaxonomy relation; Computer architecture; Computer science; Data mining; Educational institutions; Feature extraction; Information science; Ontologies; Service oriented architecture; Taxonomy; Wikipedia; Feature Text (FT) extracting; Wikipedia; entry segmenting; ontology construction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Computation, 2008. ICNC '08. Fourth International Conference on
Conference_Location :
Jinan
Print_ISBN :
978-0-7695-3304-9
Type :
conf
DOI :
10.1109/ICNC.2008.717
Filename :
4667078
Link To Document :
بازگشت