Title :
DSSM: A Data Sources Selection Model for Deep Web
Author :
Qu, Zhendong ; Shen, Derong ; Yu, Ge ; Kou, Yue ; Nie, Tiezheng
Author_Institution :
Dept. of Comput. Sci. & Eng., Northeastern Univ., Shenyang, China
Abstract :
The deep Web data integration has become more and more important due to the large amount of deep Web data sources. Nevertheless, how to select the most relevant data sources on deep web is still a challenging issue. However, the existing strategies only focus on the data sources interfaces, which are not enough to select the best-effort data sources in the same domain. To solve this problem, an integrative data sources selection model named as DSSM is proposed in this paper, in which, the interface schema, the search mode, the contents in background databases, as well as the quality of data sources are considered together. So the model has the ability to select the best-effort data sources satisfying user queries. After carrying out a series of experiments on real-world sources, we demonstrate the effectiveness of the DSSM model.
Keywords :
Internet; data handling; query processing; DSSM model; background databases; data sources interfaces; deep Web data integration; deep Web data sources; integrative data sources selection model; user queries; Application software; Books; Computer interfaces; Computer science; Crawlers; Data engineering; Databases; Decision support systems; Delay; Information systems; DSSM; Deep Web; data source; instance; schema;
Conference_Titel :
Web Information Systems and Applications Conference, 2009. WISA 2009. Sixth
Conference_Location :
Xuzhou, Jiangsu
Print_ISBN :
978-0-7695-3874-7
DOI :
10.1109/WISA.2009.44