• DocumentCode
    2224567
  • Title

    Odaies: ontology-driven adaptive Web information extraction system

  • Author

    Zhang, Ning ; Chen, Hong ; Wang, Yu ; Cheng, Shi-Jun ; Xiong, Ming-Feng

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Peking Univ., Beijing, China
  • fYear
    2003
  • fDate
    13-16 Oct. 2003
  • Firstpage
    454
  • Lastpage
    460
  • Abstract
    This paper proposes an ontology-driven self-adapting approach in the semi-structured Web information extraction field, where ontology provides semantic support and plays a central role during the extraction process. It outperforms traditional wrapper systems in adaptiveness and maintenance. Firstly, we build a domain-dependant ontology. Then we design three templates generating algorithms, which have self-adaptiveness and self-maintenance based on the ontology, to perform Web page information extraction. Experiment results show that our prototype system can achieve 100% recall and 97.64% precision.
  • Keywords
    Web sites; adaptive systems; data mining; information retrieval; Odaies; Web page; World Wide Web; domain-dependent ontology; knowledge discovery; ontology driven adaptive Web information extraction system; semantic support; wrapper system; Algorithm design and analysis; Computer science; Data mining; HTML; Hazards; Heuristic algorithms; Ontologies; Prototypes; Web pages; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Agent Technology, 2003. IAT 2003. IEEE/WIC International Conference on
  • Print_ISBN
    0-7695-1931-8
  • Type

    conf

  • DOI
    10.1109/IAT.2003.1241120
  • Filename
    1241120