• DocumentCode
    604424
  • Title

    Interface schema matching with the machine learning for deep web

  • Author

    Guanwen Zhu ; Hongbin Wang ; Nianbin Wang ; Qianqian Jiao

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Harbin Eng. Univ., Harbin, China
  • fYear
    2012
  • fDate
    29-31 Dec. 2012
  • Firstpage
    822
  • Lastpage
    825
  • Abstract
    With the rapid development of the World Wide Web, information contained in the deep web is increasing dramatically. Since different query interfaces are heterogeneous and autonomous inherently, even in the same domain, it is a huge challenge to allow users efficiently and quickly to get their own satisfying information. Deep web query interfaces integration can solve this problem well. The interface schema matching is the foremost step in the steps of the deep web query interfaces integration. This paper takes 120 data sources as a training set and 40 data sources as a testing set. Combined with the idea of multi-strategy learning technology, a deep web interface schema matching method based on machine learning is proposed. The method transformed the schema matching problem into the machine learning classification, and achieved the schema matching automatically. In order to enhance the accuracy of the mappings, the concept of domain ontology is introduced in this paper. The experimental results show that the method has an average accuracy rate of 80%-90%.
  • Keywords
    Internet; learning (artificial intelligence); ontologies (artificial intelligence); pattern classification; query processing; World Wide Web; automatic schema matching; autonomous query interfaces; deep Web query interface integration; domain ontology; heterogeneous query interfaces; interface schema matching; machine learning classification; mapping accuracy enhancement; multistrategy learning technology; deep web; domain ontology; meta-learning; multi-stratry learning; schema matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Network Technology (ICCSNT), 2012 2nd International Conference on
  • Conference_Location
    Changchun
  • Print_ISBN
    978-1-4673-2963-7
  • Type

    conf

  • DOI
    10.1109/ICCSNT.2012.6526056
  • Filename
    6526056