• DocumentCode
    495250
  • Title

    Correlated-Clustering Frame: A Holistic Method of Deep Web Schema Matching Based on Data Mining

  • Author

    Yuchen, Fu ; Quan, Liu ; Yunlong, Xu ; Chao, Zhang ; Wenyun, Zhou ; Zhiming, Cui

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
  • Volume
    5
  • fYear
    2009
  • fDate
    March 31 2009-April 2 2009
  • Firstpage
    528
  • Lastpage
    533
  • Abstract
    A large number of deep Web data sources are only accessible through their query interfaces. For any domain of interest, there may be many such sources with varied query capabilities and content coverage. To obtain mass valuable information in deep Web, we need to integrate large heterogeneous information. Schema matching is a critical problem in the integration process. This paper propose a new holistic schema matching method based on data mining, named as correlated-clustering, which mines positively correlated attributes to form potential attribute groups, and finds synonym attributes by clustering. We design experiments to implement mentioned algorithms and technology. Experimental results testify that our solution achieves accurately and effectively.
  • Keywords
    Internet; content management; data mining; pattern clustering; query processing; content coverage; correlated clustering; data mining; data sources; deep Web; holistic schema matching; large heterogeneous information; query interface; synonym attributes; Algorithm design and analysis; Books; Chaos; Computer science; Data engineering; Data mining; Databases; Large-scale systems; Measurement standards; Standards development;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Engineering, 2009 WRI World Congress on
  • Conference_Location
    Los Angeles, CA
  • Print_ISBN
    978-0-7695-3507-4
  • Type

    conf

  • DOI
    10.1109/CSIE.2009.886
  • Filename
    5170591