DocumentCode :
1918690
Title :
Web Clustering Using Social Bookmarking Data with Dimension Reduction Regarding Similarity
Author :
Yanagimoto, Hidekazu ; Yoshioka, Michifumi ; Omatu, Sigeru
Author_Institution :
Sch. of Eng., Osaka Prefecture Univ., Sakai, Japan
fYear :
2010
fDate :
9-11 Aug. 2010
Firstpage :
386
Lastpage :
390
Abstract :
We propose a web clustering method using social bookmarking data with dimension reduction regarding similarity. To realize this idea we construct the similarity matrix between web pages based on their cooccurrence frequency. Since the similarity matrix includes various kind of noise, we map the similarity matrix onto lower dimension feature space to reduce the noise. Especially we carry out dimension reduction regarding web pages´ similarity. This approach uses generalized eigenvectors and is different from usual eigenvalue problems. Using artificially generated data, we explain that the feature space constructed with our proposed method emphasizes the essential relationship between web pages. And using real social bookmarking data, we describe our proposed method can make good clusters.
Keywords :
Internet; eigenvalues and eigenfunctions; pattern clustering; Web clustering; Web pages similarity; cooccurrence frequency; dimension feature space; dimension reduction; eigenvalue problem; generalized eigenvectors; similarity matrix; social bookmarking data; Eigenvalues and eigenfunctions; Equations; Kernel; Mathematical model; Principal component analysis; Registers; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2010 International Conference on
Conference_Location :
Odense
Print_ISBN :
978-1-4244-7787-6
Electronic_ISBN :
978-0-7695-4138-9
Type :
conf
DOI :
10.1109/ASONAM.2010.95
Filename :
5563080
Link To Document :
بازگشت