DocumentCode :
3413995
Title :
DeepSearcher: A One-Time Searcher for Deep Web
Author :
Shen, Derong ; Sun, Gaoshang ; Nie, Tiezheng ; Kou, Yue
Author_Institution :
Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
Volume :
3
fYear :
2009
fDate :
12-14 Aug. 2009
Firstpage :
273
Lastpage :
277
Abstract :
The proliferation of database-driven Web sites has made user pay more effort for selecting the best satisfying results. Therefore, we propose a searching system named as DeepSearcher to meet userpsilas need, which includes offline processing (e.g. pre-processing) and online processing. The latter consists of query processor, result integrator, cache subsystem and service portal. To implement the system, key techniques such as subject-based classification, clustering-based result extraction and schema recognition, dominant attribute-based data sources ranking, query relaxation, duplicate identification and result top-k are adopted to support the searching system. The demonstration shows the feasibility and the promise of DeepSearcher.
Keywords :
Web sites; cache storage; pattern classification; pattern clustering; portals; query processing; search engines; DeepSearcher; cache subsystem; clustering-based result extraction; data sources ranking; database-driven Web sites; dominant attribute; duplicate identification; offline processing; one-time searcher; online processing; query processor; query relaxation; result integrator; schema recognition; service portal; subject-based classification; top-k; Books; Crawlers; Data mining; Databases; Educational institutions; Hybrid intelligent systems; Marketing and sales; Portals; Sun; Web pages; data extraction; data integration; deep web;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Hybrid Intelligent Systems, 2009. HIS '09. Ninth International Conference on
Conference_Location :
Shenyang
Print_ISBN :
978-0-7695-3745-0
Type :
conf
DOI :
10.1109/HIS.2009.270
Filename :
5254581
Link To Document :
بازگشت