• DocumentCode
    2959668
  • Title

    Efficient Quality Threshold Clustering for Parallel Architectures

  • Author

    Danalis, Anthony ; McCurdy, Collin ; Vetter, Jeffrey S.

  • fYear
    2012
  • fDate
    21-25 May 2012
  • Firstpage
    1068
  • Lastpage
    1079
  • Abstract
    Quality Threshold Clustering (QTC) is an algorithm for partitioning data, in fields such as biology, where clustering of large data-sets can aid scientific discovery. Unlike other clustering algorithms, QTC does not require knowing the number of clusters a priori, however, its perceived need for high computing power often makes it an unattractive choice. This paper presents a thorough study of QTC. We analyze the worst case complexity of the algorithm and discuss methods to reduce it by trading memory for computation. We also demonstrate how the expected running time of QTC is affected by the structure of the input data. We describe how QTC can be parallelized, and discuss implementation details of our thread-parallel, GPU, and distributed memory implementations of the algorithm. We demonstrate the efficiency of our implementations through experimental data. We show how data sets with tens of thousands of elements can be clustered in a matter of minutes in a modern GPU, and seconds in a small scale cluster of multi-core CPUs, or multiple GPUs. Finally, we discuss how user selected parameters, as well as algorithmic and implementation choices, affect performance.
  • Keywords
    computational complexity; distributed memory systems; graphics processing units; multi-threading; parallel architectures; pattern clustering; QTC; clustering algorithms; distributed memory implementations; large data-sets clustering; multicore CPU; multiple GPU; parallel architectures; partitioning data; quality threshold clustering; running time; scientific discovery; small scale cluster; thread-parallel; user selected parameters; worst case complexity; Algorithm design and analysis; Clustering algorithms; Complexity theory; Graphics processing unit; Memory management; Proteins; Upper bound; GPU; QT-clustering; complexity; distributed; multi-core;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International
  • Conference_Location
    Shanghai
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4673-0975-2
  • Type

    conf

  • DOI
    10.1109/IPDPS.2012.99
  • Filename
    6267912