• DocumentCode
    2617672
  • Title

    An Extension of the K-Mean Algorithm for dealing with Dimensionality Curse

  • Author

    Qaiyumi, Sayed Waleed ; Mirikitani, Derrick Takeshi

  • Author_Institution
    Goldsmiths Coll., London Univ.
  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Multi-attribute clustering and dimensionality reduction is achieved through a generalization of the k-mean algorithm. Entropy analysis is utilized to reduce dimensionality and to find initial conditions or starting points for the extended k-mean algorithm. We start with the deciding criteria attribute, and based on comparison and selection of the surrounding points, inter value comparison and gain assessment is performed leading to optimal value selection. Dwarf data cubes are utilized to mitigate the effects of the curse of dimensionality, specifically in a database environment where suffixes and prefixes are present. Finally the use of the entropy generated result for the initial conditioning of the extended k-mean algorithm is explained
  • Keywords
    entropy; pattern clustering; dimensionality curse; dimensionality reduction; dwarf data cubes; entropy analysis; gain assessment; inter value comparison; k-mean algorithm; multiattribute clustering; optimal value selection; Algorithm design and analysis; Clustering algorithms; Costs; Databases; Educational institutions; Entropy; Euclidean distance; Machine learning algorithms; Performance gain; Remuneration;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering of Intelligent Systems, 2006 IEEE International Conference on
  • Conference_Location
    Islamabad
  • Print_ISBN
    1-4244-0456-8
  • Type

    conf

  • DOI
    10.1109/ICEIS.2006.1703221
  • Filename
    1703221