• DocumentCode
    1801581
  • Title

    UsingWeb Knowledge to Improve the Wrapping of Web Sources

  • Author

    Kabisch, Thomas ; Padur, Ronald ; Rother, Dirk

  • Author_Institution
    University of Technology Berlin
  • fYear
    2006
  • fDate
    2006
  • Firstpage
    4
  • Lastpage
    4
  • Abstract
    During the wrapping of web interfaces ontological know-ledge is important in order to support an automated interpretation of information. The development of ontologies is a time consuming issue and not realistic in global contexts. On the other hand, the web provides a huge amount of knowledge, which can be used instead of ontologies. Three common classes of web knowledge sources are: Web Thesauri, search engines and Web encyclopedias. The paper investigates how Web knowledge can be utilized to solve the three semantic problems Parameter Finding for Query Interfaces, Labeling of Values and Relabeling after interface evolution. For the solution of the parameter finding problem an algorithm has been implemented using the web encyclopedia WikiPedia for the initial identification of parameter value candidates and the search engine Google for a validation of label-value relationships. The approach has been integrated into a wrapper definition framework.
  • Keywords
    Databases; Encyclopedias; Humans; Labeling; Ontologies; Search engines; Thesauri; Web pages; Wikipedia; Wrapping;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering Workshops, 2006. Proceedings. 22nd International Conference on
  • Conference_Location
    Atlanta, GA, USA
  • Print_ISBN
    0-7695-2571-7
  • Type

    conf

  • DOI
    10.1109/ICDEW.2006.160
  • Filename
    1623799