Title :
Matching Schemas in Online Communities: A Web 2.0 Approach
Author :
McCann, Robert ; Shen, Warren ; Doan, AnHai
Author_Institution :
Microsoft Corp., Redmond, WA
Abstract :
When integrating data from multiple sources, a key task that online communities often face is to match the schemas of the data sources. Today, such matching often incurs a huge workload that overwhelms the relatively small set of volunteer integrators. In such cases, community members may not even volunteer to be integrators, due to the high workload, and consequently no integration systems can be built. To address this problem, we propose to enlist the multitude of users in the community to help match the schemas, in a Web 2.0 fashion. We discuss the challenges of this approach and provide initial solutions. Finally, we describe an extensive set of experiments on both real-world and synthetic data that demonstrate the utility of the approach.
Keywords :
Internet; Web 2.0 approach; data integration; data source matching schema; online community; Bioinformatics; Deductive databases; Fans; Feedback; Humans; Lakes; Motion pictures; Robustness;
Conference_Titel :
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-1836-7
Electronic_ISBN :
978-1-4244-1837-4
DOI :
10.1109/ICDE.2008.4497419