DocumentCode :
2754311
Title :
TC-PageRank Algorithm Based on Topic Correlation
Author :
Huang, Decai ; Qi, Huachun ; Yuan, Yuan ; Zheng, Yue-feng
Author_Institution :
Coll. of Inf. Eng., Zhejiang Univ. of Technol., Hangzhou
Volume :
2
fYear :
0
fDate :
0-0 0
Firstpage :
5943
Lastpage :
5946
Abstract :
PageRank algorithm is a famous algorithm to mine the Web structure, but it has a drawback of topic-drift. To eliminate the topic-drift of the PageRank algorithm, and after the analysis of existing algorithms, a new algorithm called TC-PageRank algorithm is put forward. The TC-PageRank algorithm is based on fictitious file vector and correlation measure of cosine. Experimental results illustrate that TC-PageRank algorithm eliminates the topic-drift phenomenon effectively, and thus improves the quality of retrieving
Keywords :
Internet; correlation methods; data mining; information retrieval; TC-PageRank; Web structure mining; cosine correlation measure; fictitious file vector; topic correlation; topic-drift drawback; Algorithm design and analysis; Automobiles; Classification algorithms; Educational institutions; Internet; Prototypes; Search engines; Turning; Web sites; Hyperlink Analysis; PageRank Algorithm; Topic Correlation; Web Structure Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Control and Automation, 2006. WCICA 2006. The Sixth World Congress on
Conference_Location :
Dalian
Print_ISBN :
1-4244-0332-4
Type :
conf
DOI :
10.1109/WCICA.2006.1714219
Filename :
1714219
Link To Document :
بازگشت