DocumentCode :
2476632
Title :
A unified expanding method for content-ignorant web page clustering
Author :
Chen, Chen
Author_Institution :
Sch. of Electron. Eng. & Comput. Sci., Peking Univ., Beijing
fYear :
2008
fDate :
25-27 June 2008
Firstpage :
633
Lastpage :
638
Abstract :
The content-ignorant clustering method takes advantages in time complexity and space complexity.. than the content based methods. In this paper, the authors introduce a unified expanding method for content-ignorant Web page clustering by mining the ldquoclickthroughrdquo log, which tries to solve the problem that the ldquoclickthroughrdquo log is sparse. The relationship between two nodes which have been expanded is also defined and optimized. Analysis and experiment show that the performance of the new method has improved, by the comparison with the standard content-ignorant method. The new method can also work without iterative clustering.
Keywords :
Internet; computational complexity; data mining; pattern clustering; clickthrough log; content-ignorant Web page clustering; mining; space complexity; time complexity; unified expanding method; Automation; Bismuth; Clustering methods; Computer science; Data mining; Intelligent control; Optimization methods; Performance analysis; Uniform resource locators; Web pages; clustering; content-ignorant clustering; web data mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Control and Automation, 2008. WCICA 2008. 7th World Congress on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4244-2113-8
Electronic_ISBN :
978-1-4244-2114-5
Type :
conf
DOI :
10.1109/WCICA.2008.4592996
Filename :
4592996
Link To Document :
بازگشت