• DocumentCode
    3510918
  • Title

    Clustering Algorithm Based on Sparse Feature Vector for Interval-Scaled Variables

  • Author

    Wu, Sen ; Wei, Guiying ; Gu, Shujuan ; Ma, Xiaofang

  • Author_Institution
    Sch. of Econ. & Manage., Univ. of Sci. & Technol. of Beijing, Beijing
  • fYear
    2007
  • fDate
    21-25 Sept. 2007
  • Firstpage
    5561
  • Lastpage
    5564
  • Abstract
    A two-step algorithm, clustering algorithm based on sparse feature vector for interval-scaled variables (CABOSFV_I), is proposed for high dimensional sparse data clustering in this paper. It decomposes a high dimensional problem into several low dimensional ones in first step and then gets the final clusters by second clustering. Because the irrelevant attributes are removed from each cluster in first step, it diminishes the dimensions effectively. Furthermore, the algorithm compresses data effectively by using ´Sparse Feature Vector´. Data scale is reduced enormously, but clustering quality is not affected. Because of the effective dimension deduction and data compression, the algorithm finds clusters in high dimensional large datasets effectively and efficiently.
  • Keywords
    data mining; pattern clustering; clustering algorithm; clustering quality; data mining; interval-scaled variables; sparse data clustering; sparse feature vector; Association rules; Clustering algorithms; Clustering methods; Computational complexity; Data compression; Data mining; Discrete wavelet transforms; Inductors; Information analysis; Technology management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Wireless Communications, Networking and Mobile Computing, 2007. WiCom 2007. International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-1311-9
  • Type

    conf

  • DOI
    10.1109/WICOM.2007.1362
  • Filename
    4341137