Title :
Finding the WDB´s Query Interface in Deep Web Automatically
Author :
Lin, Peiguang ; Xu, Ruzhi ; Hong, Zhimin ; Zhang, Yan
Author_Institution :
Sch. of Comput. & Inf. Eng., Shandong Univ. of Finance, Jinan
Abstract :
Web search engines work well for finding crawlable pages, but not for finding datasets hidden behind Web search forms. On this deep Web, many sources are structured by providing structured query interfaces and results. Organizing such structured sources into a domain hierarchy that users can browse to find these valuable resources and is one of the critical steps toward the large-scale integration of heterogeneous deep Web sources. We propose an automatic classification of structured deep Web sources based on the features available on the search interfaces. Our experimental data shows that the method presented by this paper has good practicability and provides fine prerequisite for further research of deep Web.
Keywords :
Web services; database management systems; query processing; search engines; Web search engines; deep Web sources; structured query interfaces; Computer interfaces; Costs; Databases; Finance; HTML; Internet; Organizing; Search engines; Statistical analysis; Web search; deep web; information retrieval; query interfaces;
Conference_Titel :
Internet Computing in Science and Engineering, 2008. ICICSE '08. International Conference on
Conference_Location :
Harbin
Print_ISBN :
978-0-7695-3112-0
Electronic_ISBN :
978-0-7695-3112-0
DOI :
10.1109/ICICSE.2008.77