• DocumentCode
    2985317
  • Title

    An Ellipsoidal K-Means for Document Clustering

  • Author

    Dzogang, F. ; Marsala, Christophe ; Lesot, M. ; Rifqi, Maria

  • Author_Institution
    LIP6, Univ. Pierre et Marie Curie - Paris 6, Paris, France
  • fYear
    2012
  • fDate
    10-13 Dec. 2012
  • Firstpage
    221
  • Lastpage
    230
  • Abstract
    We propose an extension of the spherical K-means algorithm to deal with settings where the number of data points is largely inferior to the number of dimensions. We assume the data to lie in local and dense regions of the original space and we propose to embed each cluster into its specific ellipsoid. A new objective function is introduced, analytical solutions are derived for both the centroids and the associated ellipsoids. Furthermore, a study on the complexity of this algorithm highlights that it is of same order as the regular K-means algorithm. Results on both synthetic and real data show the efficiency of the proposed method.
  • Keywords
    computational complexity; document handling; pattern clustering; algorithm complexity; document clustering; ellipsoid; ellipsoidal k-means; spherical k-means algorithm; Clustering algorithms; Ellipsoids; Feature extraction; Linear programming; Partitioning algorithms; Tuning; Vectors; clustering; feature selection; information retrieval; spherical k-means;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4673-4649-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2012.126
  • Filename
    6413900