Title of article :
DistanceRank: An intelligent ranking algorithm for web pages
Author/Authors :
Ali Mohammad Zareh Bidoki، نويسنده , , Nasser Yazdani، نويسنده ,
Issue Information :
دوماهنامه با شماره پیاپی سال 2008
Pages :
16
From page :
877
To page :
892
Abstract :
A fast and efficient page ranking mechanism for web crawling and retrieval remains as a challenging issue. Recently, several link based ranking algorithms like PageRank, HITS and OPIC have been proposed. In this paper, we propose a novel recursive method based on reinforcement learning which considers distance between pages as punishment, called “DistanceRank” to compute ranks of web pages. The distance is defined as the number of “average clicks” between two pages. The objective is to minimize punishment or distance so that a page with less distance to have a higher rank. Experimental results indicate that DistanceRank outperforms other ranking algorithms in page ranking and crawling scheduling. Furthermore, the complexity of DistanceRank is low. We have used University of California at Berkeley’s web for our experiments.
Keywords :
Crawling , Web graph , reinforcement learning , Web ranking
Journal title :
Information Processing and Management
Serial Year :
2008
Journal title :
Information Processing and Management
Record number :
1228772
Link To Document :
بازگشت