Title :
Method of Data Clustering Incomplete Fill Based on Constraint Tolerance Set Dissimilarity
Author_Institution :
Beijing Inf. Technol. Coll., Beijing, China
Abstract :
In the field of data mining, clustering with loss data is one of the common problems. The traditional approach is based on a probability model to predict the loss data for prediction filling, the limitations of such methods is not suitable for the processing of large amount of data. An improved data clustering incomplete fill algorithm is proposed based on constraint tolerance set dissimilarity, this method does not use the probability hypothesis, and the characteristics of the loss data are used directly. The overall dissimilarity analysis is taken with the set mode, the constraint tolerance difference and the corresponding data object are simplified, the thematic analysis of data set is combined, and the loss data is filled. The baseline data set is used as the experiment sample, the simulation result shows that this method has higher processing speed, the accuracy is improved. It has good application value in data clustering research.
Keywords :
data mining; pattern clustering; probability; constraint tolerance set dissimilarity; data mining; dissimilarity analysis; improved data clustering incomplete fill algorithm; loss data prediction; prediction filling; probability hypothesis; Algorithm design and analysis; Clustering algorithms; Data mining; Data models; Filling; Single photon emission computed tomography; Standards; Data mining; Clustering data; Constraint tolerance difference; Data fill;
Conference_Titel :
Intelligent Systems Design and Engineering Applications (ISDEA), 2014 Fifth International Conference on
Conference_Location :
Hunan
Print_ISBN :
978-1-4799-4262-6
DOI :
10.1109/ISDEA.2014.144