• DocumentCode
    3321729
  • Title

    Matching Schemas in Online Communities: A Web 2.0 Approach

  • Author

    McCann, Robert ; Shen, Warren ; Doan, AnHai

  • Author_Institution
    Microsoft Corp., Redmond, WA
  • fYear
    2008
  • fDate
    7-12 April 2008
  • Firstpage
    110
  • Lastpage
    119
  • Abstract
    When integrating data from multiple sources, a key task that online communities often face is to match the schemas of the data sources. Today, such matching often incurs a huge workload that overwhelms the relatively small set of volunteer integrators. In such cases, community members may not even volunteer to be integrators, due to the high workload, and consequently no integration systems can be built. To address this problem, we propose to enlist the multitude of users in the community to help match the schemas, in a Web 2.0 fashion. We discuss the challenges of this approach and provide initial solutions. Finally, we describe an extensive set of experiments on both real-world and synthetic data that demonstrate the utility of the approach.
  • Keywords
    Internet; Web 2.0 approach; data integration; data source matching schema; online community; Bioinformatics; Deductive databases; Fans; Feedback; Humans; Lakes; Motion pictures; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
  • Conference_Location
    Cancun
  • Print_ISBN
    978-1-4244-1836-7
  • Electronic_ISBN
    978-1-4244-1837-4
  • Type

    conf

  • DOI
    10.1109/ICDE.2008.4497419
  • Filename
    4497419