• DocumentCode
    2348463
  • Title

    An Efficient Dimension Reduction and Optimal Cluster Center Initialization Technique

  • Author

    Rajput, Dharmveer Singh ; Singh, Praveen Kumar ; Bhattacharya, Mahua

  • Author_Institution
    ABV, Indian Inst. of Inf. Technol. & Manage., Gwalior, India
  • fYear
    2010
  • fDate
    26-28 Nov. 2010
  • Firstpage
    503
  • Lastpage
    508
  • Abstract
    Most of the clustering algorithms perform loosely when dimensionality of the data set increase because some dimensions contain irrelevant or noisy data and randomly initialization of clusters centres gives the local optimum clustering. In this paper, we proposed a technique for reducing the effect of high dimensionality and randomly initialization of clusters centres. It consists of three phases. In first phase, the standard deviation is used to select the meaningful dimensions from high dimensional data set. In second phase, the selected dimensions produce the k initial centres by adding and subtracting the constant from its grand mean and those initial cluster centres are used in the k-means to find optimal clustering of data set in the third phase. Empirical results have shown its favorable performance in comparison with standard k-means clustering algorithms.
  • Keywords
    data compression; data handling; pattern clustering; dimension reduction; k-means clustering algorithms; optimal cluster center initialization technique; standard deviation; Feature selection; High dimensional data; K-Means; Subspace;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Communication Networks (CICN), 2010 International Conference on
  • Conference_Location
    Bhopal
  • Print_ISBN
    978-1-4244-8653-3
  • Electronic_ISBN
    978-0-7695-4254-6
  • Type

    conf

  • DOI
    10.1109/CICN.2010.100
  • Filename
    5702022