DocumentCode :
495250
Title :
Correlated-Clustering Frame: A Holistic Method of Deep Web Schema Matching Based on Data Mining
Author :
Yuchen, Fu ; Quan, Liu ; Yunlong, Xu ; Chao, Zhang ; Wenyun, Zhou ; Zhiming, Cui
Author_Institution :
Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
Volume :
5
fYear :
2009
fDate :
March 31 2009-April 2 2009
Firstpage :
528
Lastpage :
533
Abstract :
A large number of deep Web data sources are only accessible through their query interfaces. For any domain of interest, there may be many such sources with varied query capabilities and content coverage. To obtain mass valuable information in deep Web, we need to integrate large heterogeneous information. Schema matching is a critical problem in the integration process. This paper propose a new holistic schema matching method based on data mining, named as correlated-clustering, which mines positively correlated attributes to form potential attribute groups, and finds synonym attributes by clustering. We design experiments to implement mentioned algorithms and technology. Experimental results testify that our solution achieves accurately and effectively.
Keywords :
Internet; content management; data mining; pattern clustering; query processing; content coverage; correlated clustering; data mining; data sources; deep Web; holistic schema matching; large heterogeneous information; query interface; synonym attributes; Algorithm design and analysis; Books; Chaos; Computer science; Data engineering; Data mining; Databases; Large-scale systems; Measurement standards; Standards development;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location :
Los Angeles, CA
Print_ISBN :
978-0-7695-3507-4
Type :
conf
DOI :
10.1109/CSIE.2009.886
Filename :
5170591
Link To Document :
بازگشت