DocumentCode
2933131
Title
Exploiting Routing Information Encoded into Backlinks to Improve Topical Crawling
Author
Mouton, Alban ; Marteau, Pierre-Francois
Author_Institution
Valeria Eur. Univ. of Brittany, Vannes, France
fYear
2009
fDate
4-7 Dec. 2009
Firstpage
659
Lastpage
664
Abstract
Local link analysis of topical graphs on the Web allows to experiment focused crawling strategies in a detailed way. In this scope, models, parameters and metrics used to orientate the crawler can be better understood, tuned and evaluated. We develop a methodological and experimental approach that exploits link analysis in order to determine what constitutes a good content analysis metric able to guide efficiently topical crawlers toward highly relevant areas of the Web. Our experimentations show that partial knowledge of the local topology of topical graph highlights our understanding of routing capabilities of various metrics. Furthermore, our experimentations demonstrate that significant crawling efficiency improvement can be reached.
Keywords
Internet; graph theory; telecommunication network routing; telecommunication network topology; Web; backlinks; content analysis metric; link analysis; local link analysis; local topology; routing capabilities; routing information; topical crawling; topical graph; Computer applications; Constraint optimization; Containers; Design optimization; Integer linear programming; Laboratories; Pattern recognition; Printing; Routing; Testing; Topical crawling; Web topology; backlinks;
fLanguage
English
Publisher
ieee
Conference_Titel
Soft Computing and Pattern Recognition, 2009. SOCPAR '09. International Conference of
Conference_Location
Malacca
Print_ISBN
978-1-4244-5330-6
Electronic_ISBN
978-0-7695-3879-2
Type
conf
DOI
10.1109/SoCPaR.2009.129
Filename
5370352
Link To Document