• DocumentCode
    3118879
  • Title

    Fuzzy clustering of large-scale data sets using Principal Component Analysis

  • Author

    Arfaoui, Olfa ; Sassi, Minyar

  • Author_Institution
    Nat. Eng. Sch. of Tunis, Tunis, Tunisia
  • fYear
    2011
  • fDate
    27-30 June 2011
  • Firstpage
    683
  • Lastpage
    690
  • Abstract
    To effectively exploit large-scale data sets using a limited storage space, it is necessary to find a special treatment which reduces them. There are certain methods with this intention. We can quote clustering method. However, this method proves its limits in the case of large-scale data sets. In this paper, we propose to reduce the workspace using the Principal Component Analysis (PCA). We work with fuzzy clustering of a data set in which users don´t know the optimal number of clusters to be generated. We proved the effectiveness of the pre-processing use of this technique before any clustering operation.
  • Keywords
    data compression; data mining; pattern clustering; principal component analysis; fuzzy clustering; large-scale data sets; principal component analysis; quote clustering method; storage space; Clustering algorithms; Correlation; Covariance matrix; Eigenvalues and eigenfunctions; Equations; Mathematical model; Principal component analysis; Clustering; Data Compression; Fuzzy Logic; Principal Component Analysis; Sampling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1098-7584
  • Print_ISBN
    978-1-4244-7315-1
  • Electronic_ISBN
    1098-7584
  • Type

    conf

  • DOI
    10.1109/FUZZY.2011.6007435
  • Filename
    6007435