DocumentCode :
2579004
Title :
A Novel URL Assignment Model Based on Multi-objective Decision Making Method
Author :
Huang, Qiuyan ; Li, Qingzhong ; Yan, Zhongmin
Author_Institution :
Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
fYear :
2012
fDate :
16-18 Nov. 2012
Firstpage :
31
Lastpage :
34
Abstract :
With the tremendous growth of the Web, it has become a huge challenge for the single-process crawlers to locate the resources that are precise and relevant to some topics in an appropriate amount of time, so it is increasingly important to use the parallel crawler. However, due to the parallelism of crawlers, one headache problem we have to face is how to distribute the URLs to crawlers to make the parallel system work coordinately and thereby make sure that the Web pages fetched are of high quality. In this paper, a novel URL assignment model for the parallel crawler is described, which is based on multi-objective decision making method and considers multiple factors synthetically such as load balance, overlap and so on. Extensive experiments test and validate our techniques.
Keywords :
Web sites; decision making; information retrieval; resource allocation; URL assignment model; Web pages; load balance; multiobjective decision making method; parallel crawler; single-process crawlers; Computational modeling; Computer architecture; Crawlers; Decision making; Load modeling; Loading; Web pages; URL assignment model; multi-objective decision making method; parallel crawler; web crawler;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Information Systems and Applications Conference (WISA), 2012 Ninth
Conference_Location :
Haikou
Print_ISBN :
978-1-4673-3054-1
Type :
conf
DOI :
10.1109/WISA.2012.19
Filename :
6385178
Link To Document :
بازگشت