Title :
An Effective Method for Classification of High Dimensional Data
Author :
Lam, Benson S Y ; Yan, Hong
Author_Institution :
City Univ. of Hong Kong, Hong Kong
Abstract :
We study a new high dimensional data problem in this paper. In pattern classification, if many dimensions of two groups share a similar distribution, the classification error rates will be 50%. We have proposed a new clustering algorithm to deal with this problem. Its basic idea is to confine the support of the optimization equation so that the data points in one group can only have small contribution to the estimated cluster center in another group. Experiments show that the proposed method is able to yield good results in eight real world data sets and its performance is better than 10 existing methods.
Keywords :
learning (artificial intelligence); pattern classification; pattern clustering; classification error rates; clustering algorithm; high dimensional data; machine learning; optimization equation; pattern classification; Clustering algorithms; Cybernetics; Data engineering; Electronic mail; Equations; Error analysis; Handwriting recognition; Machine learning; Pattern classification; Shape; Calculus of variations; Classifcation of high imensional data; Clustering; Machine learning;
Conference_Titel :
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-0973-0
Electronic_ISBN :
978-1-4244-0973-0
DOI :
10.1109/ICMLC.2007.4370608