• DocumentCode
    708633
  • Title

    Clustering of large data based on the relational analysis

  • Author

    Slaoui, Said Chah ; Lamari, Yasmine

  • Author_Institution
    Comput. Sci. Res. Lab., Mohammed V Univ., Rabat, Morocco
  • fYear
    2015
  • fDate
    25-26 March 2015
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    This paper presents a fast heuristic which finds clusters by partitioning categorical large data sets according to the Relational Analysis, whereby the cluster analysis is modeled as a linear integer program with n2 attributes (n is the number of observations) and solved by the optimization under constraints of the Condorcet criterion. Without neither a sampling method nor the fixing of input parameters and while using a natural cluster structure, Transitive heuristic needs a small amount of memory and a short time to provide good quality partition. Experimental results on real and synthetic data sets are presented in order to show that clusters, formed using this technique, are intensive and accurate.
  • Keywords
    integer programming; linear programming; pattern clustering; statistical analysis; Condorcet criterion; categorical large data sets; cluster analysis; large data clustering; linear integer program; natural cluster structure; relational analysis; Algorithm design and analysis; Clustering algorithms; Generators; Heuristic algorithms; Linear programming; Partitioning algorithms; Sampling methods; Categorical data; Cluster analysis; Condorcet criterion; Partitioning heuristic; Relational Analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems and Computer Vision (ISCV), 2015
  • Conference_Location
    Fez
  • Print_ISBN
    978-1-4799-7510-5
  • Type

    conf

  • DOI
    10.1109/ISACV.2015.7105550
  • Filename
    7105550