Title :
Automatic Domain-Ontology Relation Extraction from Semi-structured Texts
Author :
Xiao, Cheng ; Zheng, Dequan ; Yang, Yuhang ; Shao, Guojun
Author_Institution :
MOE-MS Key Lab. of Natural Language Process. & Speech, Harbin Inst. of Technol., Harbin, China
Abstract :
This paper presents a new method to acquire domain-ontology relations from semi-structured data sources. First, obtain Web documents according to the co-occurrence of concept instance and attribute value. Further, define formats of relation patterns, and extract pattern instances from Web documents, including pattern clustering and pattern combining in each cluster. Finally, relation pattern instances are applied to gain attribute values of new concept instances in domain-ontology. Experiments are carried out in the field of film, the rate of pattern incorrect-division and pattern leakage are respectively 0.19% and 1.31%, the highest precision of combined relation patterns reaches 85%. Experimental results demonstrate that the method developed in this paper is fairly efficient.
Keywords :
information filtering; ontologies (artificial intelligence); text analysis; Web documents; automatic domain-ontology relation extraction; pattern clustering; pattern combining; pattern leakage; semistructured data sources; semistructured texts; Clustering algorithms; Data analysis; Data mining; Information analysis; Laboratories; Lattices; Natural language processing; Natural languages; Search engines; Terminology; ontology structure; pattern instance; relation extraction; semi-structure;
Conference_Titel :
Asian Language Processing, 2009. IALP '09. International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-0-7695-3904-1
DOI :
10.1109/IALP.2009.51