DocumentCode :
3099187
Title :
A New Extraction Concept Based on Contextual Clustering
Author :
Karoui, Lobna ; Aufaure, Marie-Aude ; Bennacer, Nacera
Author_Institution :
Ecole Super. d ´´Electricite, Gif-sur-Yvette
fYear :
2006
fDate :
Nov. 28 2006-Dec. 1 2006
Firstpage :
91
Lastpage :
91
Abstract :
Ontologies provide a common layer that plays a major role in information exchange and support sharing. Ontologies proliferation relies strongly on the automation of their building, integration and deployment processes. In this paper, we present an integrated framework involving complementary dimensions to drive the (semi) automatic acquisition conceptual knowledge process from HTML Web pages. Our approach takes advantage from structural HTML document features and the word location to identify the appropriate term context. Our context definition improves word weighting, the selection of the semantically closer cooccurrents and the relevant extracted ontological concepts. We use an unsupervised clustering method for term groups´ generation. Notice that the chosen clustering method relies on a user incremental quality evaluation process. In this paper and after a theoretical presentation of our structural contextual definition, we summarize the most significant results obtained by applying our method on a corpus dedicated to the tourism domain. The first results show how the definition of an appropriate context improves the relevance of the extracted concepts.
Keywords :
Web sites; hypermedia markup languages; ontologies (artificial intelligence); semantic Web; HTML Web pages; HTML document; contextual clustering; information exchange; ontologies proliferation; semantic Web; unsupervised clustering method; user incremental quality evaluation process; Automation; Buildings; Clustering methods; Computational intelligence; Data mining; HTML; Logic programming; Ontologies; Semantic Web; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence for Modelling, Control and Automation, 2006 and International Conference on Intelligent Agents, Web Technologies and Internet Commerce, International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
0-7695-2731-0
Type :
conf
DOI :
10.1109/CIMCA.2006.19
Filename :
4052728
Link To Document :
بازگشت