• DocumentCode
    2050578
  • Title

    Quartile and Outlier Detection on Heterogeneous Clusters Using Distributed Radix Sort

  • Author

    Spafford, Kyle L. ; Meredith, Jeremy S. ; Vetter, Jeffrey S.

  • Author_Institution
    Future Technol. Group, Oak Ridge Nat. Lab., Oak Ridge, TN, USA
  • fYear
    2011
  • fDate
    26-30 Sept. 2011
  • Firstpage
    412
  • Lastpage
    419
  • Abstract
    In the past few years, performance improvements in CPUs and memory technologies have outpaced those of storage systems. When extrapolated to the exascale, this trend places strict limits on the amount of data that can be written to disk for full analysis, resulting in an increased reliance on characterizing in-memory data. Many of these characterizations are simple, but require sorted data. This paper explores an example of this type of characterization -- the identification of quartiles and statistical outliers -- and presents a performance analysis of a distributed heterogeneous radix sort as well as an assessment of current architectural bottlenecks.
  • Keywords
    extrapolation; sorting; statistical analysis; storage management; distributed radix sort; heterogeneous cluster; outlier detection; quartile detection; statistical outliers; storage system; Algorithm design and analysis; Bandwidth; Graphics processing unit; Histograms; Indexing; Runtime; Sorting; GPUs; performance analysis; sorting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing (CLUSTER), 2011 IEEE International Conference on
  • Conference_Location
    Austin, TX
  • Print_ISBN
    978-1-4577-1355-2
  • Electronic_ISBN
    978-0-7695-4516-5
  • Type

    conf

  • DOI
    10.1109/CLUSTER.2011.53
  • Filename
    6061072