Title :
A Methodology to Discover Semantic Features from Textual Resources
Author :
Vicient, Carlos ; Sanchez, Dominick ; Moreno, Antonio
Author_Institution :
Dept. d´´Eng. Inf. i Mat., Univ. Rovira i Virgili, Tarragona, Spain
Abstract :
Data analysis algorithms focused on processing textual data rely on the extraction of relevant features from text and the appropriate association to their formal semantics. In this paper, a method to assist this task, annotating extracted textual features with concepts from a background ontology, is presented. The method is automatic and unsupervised and it has been designed in a generic way, so it can be applied to textual resources ranging from plain text to semi-structured resources (like Wikipedia articles). The system has been tested with tourist destinations and Wikipedia articles showing promising results.
Keywords :
data analysis; feature extraction; ontologies (artificial intelligence); travel industry; Wikipedia articles; background ontology; data analysis algorithms; formal semantic features discovery; textual data processing; textual feature extraction annotation; textual resources; tourist destinations; Electronic publishing; Encyclopedias; Feature extraction; Internet; Ontologies; Semantics; Feature discovery; Information Extraction; Ontologie; Wikipedia;
Conference_Titel :
Semantic Media Adaptation and Personalization (SMAP), 2011 Sixth International Workshop on
Conference_Location :
Pontevedra
Print_ISBN :
978-1-4577-1372-9
DOI :
10.1109/SMAP.2011.13