• DocumentCode
    3099187
  • Title

    A New Extraction Concept Based on Contextual Clustering

  • Author

    Karoui, Lobna ; Aufaure, Marie-Aude ; Bennacer, Nacera

  • Author_Institution
    Ecole Super. d ´´Electricite, Gif-sur-Yvette
  • fYear
    2006
  • fDate
    Nov. 28 2006-Dec. 1 2006
  • Firstpage
    91
  • Lastpage
    91
  • Abstract
    Ontologies provide a common layer that plays a major role in information exchange and support sharing. Ontologies proliferation relies strongly on the automation of their building, integration and deployment processes. In this paper, we present an integrated framework involving complementary dimensions to drive the (semi) automatic acquisition conceptual knowledge process from HTML Web pages. Our approach takes advantage from structural HTML document features and the word location to identify the appropriate term context. Our context definition improves word weighting, the selection of the semantically closer cooccurrents and the relevant extracted ontological concepts. We use an unsupervised clustering method for term groups´ generation. Notice that the chosen clustering method relies on a user incremental quality evaluation process. In this paper and after a theoretical presentation of our structural contextual definition, we summarize the most significant results obtained by applying our method on a corpus dedicated to the tourism domain. The first results show how the definition of an appropriate context improves the relevance of the extracted concepts.
  • Keywords
    Web sites; hypermedia markup languages; ontologies (artificial intelligence); semantic Web; HTML Web pages; HTML document; contextual clustering; information exchange; ontologies proliferation; semantic Web; unsupervised clustering method; user incremental quality evaluation process; Automation; Buildings; Clustering methods; Computational intelligence; Data mining; HTML; Logic programming; Ontologies; Semantic Web; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence for Modelling, Control and Automation, 2006 and International Conference on Intelligent Agents, Web Technologies and Internet Commerce, International Conference on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    0-7695-2731-0
  • Type

    conf

  • DOI
    10.1109/CIMCA.2006.19
  • Filename
    4052728