DocumentCode
3321729
Title
Matching Schemas in Online Communities: A Web 2.0 Approach
Author
McCann, Robert ; Shen, Warren ; Doan, AnHai
Author_Institution
Microsoft Corp., Redmond, WA
fYear
2008
fDate
7-12 April 2008
Firstpage
110
Lastpage
119
Abstract
When integrating data from multiple sources, a key task that online communities often face is to match the schemas of the data sources. Today, such matching often incurs a huge workload that overwhelms the relatively small set of volunteer integrators. In such cases, community members may not even volunteer to be integrators, due to the high workload, and consequently no integration systems can be built. To address this problem, we propose to enlist the multitude of users in the community to help match the schemas, in a Web 2.0 fashion. We discuss the challenges of this approach and provide initial solutions. Finally, we describe an extensive set of experiments on both real-world and synthetic data that demonstrate the utility of the approach.
Keywords
Internet; Web 2.0 approach; data integration; data source matching schema; online community; Bioinformatics; Deductive databases; Fans; Feedback; Humans; Lakes; Motion pictures; Robustness;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location
Cancun
Print_ISBN
978-1-4244-1836-7
Electronic_ISBN
978-1-4244-1837-4
Type
conf
DOI
10.1109/ICDE.2008.4497419
Filename
4497419
Link To Document