DocumentCode :
527608
Title :
An incremental update strategy in Deep Web
Author :
Li, Hui ; Guo, Mei ; Cai, Liang ; Yang, Yanwu
Author_Institution :
Coll. of Inf., BUCT, Beijing, China
Volume :
1
fYear :
2010
fDate :
10-12 Aug. 2010
Firstpage :
131
Lastpage :
134
Abstract :
An effective incremental web crawler maintains a local repository of web pages up to date. In this paper, we will introduce an approach to update pages in Deep Web. Unlike traditional studies which mainly concentrate on “important pages” or “refresh”, We classify pages with different ways of calculating the measure of their respective priorities, obtain the coefficient ratio between the derived categories through experimental statistics, and automatically adjust parameters to achieve the incremental update.
Keywords :
Internet; Web sites; classification; search engines; statistical analysis; Web pages; deep Web; experimental statistics; incremental Web crawler; incremental update strategy; pages classification; Books; Crawlers; Heuristic algorithms; Measurement; Navigation; Web pages; crawler; deep web; incremantal update; url categories;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Computation (ICNC), 2010 Sixth International Conference on
Conference_Location :
Yantai, Shandong
Print_ISBN :
978-1-4244-5958-2
Type :
conf
DOI :
10.1109/ICNC.2010.5583330
Filename :
5583330
Link To Document :
بازگشت