DocumentCode
708633
Title
Clustering of large data based on the relational analysis
Author
Slaoui, Said Chah ; Lamari, Yasmine
Author_Institution
Comput. Sci. Res. Lab., Mohammed V Univ., Rabat, Morocco
fYear
2015
fDate
25-26 March 2015
Firstpage
1
Lastpage
7
Abstract
This paper presents a fast heuristic which finds clusters by partitioning categorical large data sets according to the Relational Analysis, whereby the cluster analysis is modeled as a linear integer program with n2 attributes (n is the number of observations) and solved by the optimization under constraints of the Condorcet criterion. Without neither a sampling method nor the fixing of input parameters and while using a natural cluster structure, Transitive heuristic needs a small amount of memory and a short time to provide good quality partition. Experimental results on real and synthetic data sets are presented in order to show that clusters, formed using this technique, are intensive and accurate.
Keywords
integer programming; linear programming; pattern clustering; statistical analysis; Condorcet criterion; categorical large data sets; cluster analysis; large data clustering; linear integer program; natural cluster structure; relational analysis; Algorithm design and analysis; Clustering algorithms; Generators; Heuristic algorithms; Linear programming; Partitioning algorithms; Sampling methods; Categorical data; Cluster analysis; Condorcet criterion; Partitioning heuristic; Relational Analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems and Computer Vision (ISCV), 2015
Conference_Location
Fez
Print_ISBN
978-1-4799-7510-5
Type
conf
DOI
10.1109/ISACV.2015.7105550
Filename
7105550
Link To Document