Title :
Model Based Unsupervised Learning Guided by Abundant Background Samples
Author :
Mahdi, Rami N. ; Rouchka, Eric C.
Author_Institution :
Dept. of Comput. Eng. & Comput. Sci., Univ. of Louisville, Louisville, KY
Abstract :
Many data sets contain an abundance of background data or samples belonging to classes not currently under consideration. We present a new unsupervised learning method based on fuzzy c-means to learn sub models of a class using background samples to guide cluster split and merge operations. The proposed method demonstrates how background samples can be used to guide and improve the clustering process. The proposed method results in more accurate clusters and helps to escape locally minimum solutions. In addition, the number of clusters is determined for the class under consideration. The method demonstrates remarkable performance on both synthetic 2D and real world data from the MNIST dataset of hand written digits.
Keywords :
data analysis; fuzzy set theory; pattern clustering; unsupervised learning; background data; background samples; class submodel learning; cluster split operation; fuzzy c-means; merge operation; model based unsupervised learning; Application software; Bioinformatics; Clustering algorithms; Computer science; Feedback; Landmine detection; Learning systems; Machine learning; Pattern recognition; Unsupervised learning; FCM; background samples; clustering; unsupervised learning;
Conference_Titel :
Machine Learning and Applications, 2008. ICMLA '08. Seventh International Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
978-0-7695-3495-4
DOI :
10.1109/ICMLA.2008.28