DocumentCode
2985317
Title
An Ellipsoidal K-Means for Document Clustering
Author
Dzogang, F. ; Marsala, Christophe ; Lesot, M. ; Rifqi, Maria
Author_Institution
LIP6, Univ. Pierre et Marie Curie - Paris 6, Paris, France
fYear
2012
fDate
10-13 Dec. 2012
Firstpage
221
Lastpage
230
Abstract
We propose an extension of the spherical K-means algorithm to deal with settings where the number of data points is largely inferior to the number of dimensions. We assume the data to lie in local and dense regions of the original space and we propose to embed each cluster into its specific ellipsoid. A new objective function is introduced, analytical solutions are derived for both the centroids and the associated ellipsoids. Furthermore, a study on the complexity of this algorithm highlights that it is of same order as the regular K-means algorithm. Results on both synthetic and real data show the efficiency of the proposed method.
Keywords
computational complexity; document handling; pattern clustering; algorithm complexity; document clustering; ellipsoid; ellipsoidal k-means; spherical k-means algorithm; Clustering algorithms; Ellipsoids; Feature extraction; Linear programming; Partitioning algorithms; Tuning; Vectors; clustering; feature selection; information retrieval; spherical k-means;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location
Brussels
ISSN
1550-4786
Print_ISBN
978-1-4673-4649-8
Type
conf
DOI
10.1109/ICDM.2012.126
Filename
6413900
Link To Document