Title :
Automated Construction Chinese Domain Ontology from Wikipedia
Author :
Lian, Li ; Ma, Jun ; Lei, JingSheng ; Song, Ling ; Liu, LeBo
Author_Institution :
Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan
Abstract :
Wikipedia (Wiki) is a collaborative on-line encyclopedia, where Web users are able to share their knowledge about a certain topic. How to make use of the rich knowledge in the Wiki is a big challenge. In this paper we propose a method to construct domain ontology from the Chinese Wiki automatically. The main idea in this paper is based on the entry segmenting and feature text (FT) extracting, where we segment the name of entries and establish the concept hierarchy firstly. Secondly, we extract the FTs from the descriptions of entries to eliminate the redundant information. Finally we calculate the similarity between pairs of FTs to revise the concept hierarchy and gain non-taxonomy relations between concepts. The primary experiment indicates that our method is useful for Chinese domain ontology construction.
Keywords :
Web sites; feature extraction; natural language processing; ontologies (artificial intelligence); text analysis; Wikipedia; automated Chinese domain ontology construction; collaborative online encyclopedia; concept hierarchy; entry description; entry segmentation; feature text extraction; nontaxonomy relation; Computer architecture; Computer science; Data mining; Educational institutions; Feature extraction; Information science; Ontologies; Service oriented architecture; Taxonomy; Wikipedia; Feature Text (FT) extracting; Wikipedia; entry segmenting; ontology construction;
Conference_Titel :
Natural Computation, 2008. ICNC '08. Fourth International Conference on
Conference_Location :
Jinan
Print_ISBN :
978-0-7695-3304-9
DOI :
10.1109/ICNC.2008.717