Title :
An Extension of the K-Mean Algorithm for dealing with Dimensionality Curse
Author :
Qaiyumi, Sayed Waleed ; Mirikitani, Derrick Takeshi
Author_Institution :
Goldsmiths Coll., London Univ.
Abstract :
Multi-attribute clustering and dimensionality reduction is achieved through a generalization of the k-mean algorithm. Entropy analysis is utilized to reduce dimensionality and to find initial conditions or starting points for the extended k-mean algorithm. We start with the deciding criteria attribute, and based on comparison and selection of the surrounding points, inter value comparison and gain assessment is performed leading to optimal value selection. Dwarf data cubes are utilized to mitigate the effects of the curse of dimensionality, specifically in a database environment where suffixes and prefixes are present. Finally the use of the entropy generated result for the initial conditioning of the extended k-mean algorithm is explained
Keywords :
entropy; pattern clustering; dimensionality curse; dimensionality reduction; dwarf data cubes; entropy analysis; gain assessment; inter value comparison; k-mean algorithm; multiattribute clustering; optimal value selection; Algorithm design and analysis; Clustering algorithms; Costs; Databases; Educational institutions; Entropy; Euclidean distance; Machine learning algorithms; Performance gain; Remuneration;
Conference_Titel :
Engineering of Intelligent Systems, 2006 IEEE International Conference on
Conference_Location :
Islamabad
Print_ISBN :
1-4244-0456-8
DOI :
10.1109/ICEIS.2006.1703221