DocumentCode :
145423
Title :
Ontology Population from the Web: An Inductive Logic Programming-Based Approach
Author :
Lima, Raphaela ; Oliveira, Henrique ; Freitas, Fred ; Espinasse, Bernard
Author_Institution :
Inf. Center, Fed. Univ. of Pernambuco, Recife, Brazil
fYear :
2014
fDate :
7-9 April 2014
Firstpage :
473
Lastpage :
478
Abstract :
The rapid growth of the Web and the information overload problem demand the development of practical information extraction (IE) solutions for web content processing. Ontology Population (OP) concerns both the extraction and classification of instances of the concepts and relations defined by an ontology. Developing IE rules for OP is an intensive and time-consuming process. Thus, an automated mechanism, based on machine-learning techniques, able to convert textual data from web pages into ontology instances may be a crucial path. This paper presents an inductive logic programming-based method that automatic induces symbolic extraction rules, which are used for populating a domain ontology with instances of entity classes. This method uses domain-independent linguistic patterns for retrieving candidate instances from web pages, and a WordNet semantic similarity measure as background knowledge to be used as input by a generic inductive logic programming system. Experiments were conducted concerning both the instance classification problem and a comparison with other popular machine learning algorithms, with encouraging results.
Keywords :
Internet; information retrieval; learning (artificial intelligence); logic programming; ontologies (artificial intelligence); pattern classification; IE solutions; OP; Web content processing; Web pages; WordNet semantic similarity measure; background knowledge; candidate instances retrieval; domain-independent linguistic patterns; entity classes; generic inductive logic programming system; information extraction; information overload problem; instance classification problem; machine-learning techniques; ontology population; symbolic extraction rules; textual data; Feature extraction; Logic programming; Ontologies; Semantics; Sociology; Statistics; Inductive Logic Programming; Information Extraction; Ontology Population; Pattern Learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology: New Generations (ITNG), 2014 11th International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4799-3187-3
Type :
conf
DOI :
10.1109/ITNG.2014.60
Filename :
6822242
Link To Document :
بازگشت