DocumentCode :
3324983
Title :
Index Design for Dynamic Personalized PageRank
Author :
Pathak, Amit ; Chakrabarti, Soumen ; Gupta, Manish
Author_Institution :
IIT Bombay, Mumbai
fYear :
2008
fDate :
7-12 April 2008
Firstpage :
1489
Lastpage :
1491
Abstract :
Personalized page rank, related to random walks with restarts and conductance in resistive networks, is a frequent search paradigm for graph-structured databases. While efficient batch algorithms exist for static whole-graph page rank, interactive query-time personalized page rank has proved more challenging. Here we describe how to select and build indices for a popular class of page rank algorithms, so as to provide real-time personalized page rank and smoothly trade off between index size, preprocessing time, and query speed. We achieve this by developing a precise, yet efficiently estimated performance model for personalized page rank query execution. We use this model in conjunction with a query workload in a cost-benefit type index optimizer. On millions of queries from CiteSeer and its data graphs with 74-320 thousand nodes, our algorithm runs 50-400 x faster than whole-graph page rank, the gap growing with graph size. Index size is 10-20% of a text index. Ranking accuracy is above 94%.
Keywords :
database theory; graph theory; query processing; cost-benefit type index optimizer; graph-structured databases; index design; interactive query-time personalized page rank; random walks; resistive networks; search paradigm; Costs; Databases; Indexes; Joining processes; Legged locomotion; Predictive models; Query processing; Testing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-1836-7
Electronic_ISBN :
978-1-4244-1837-4
Type :
conf
DOI :
10.1109/ICDE.2008.4497599
Filename :
4497599
Link To Document :
بازگشت