• DocumentCode
    1688665
  • Title

    Outlier detection in performance data of parallel applications

  • Author

    Benkert, Katharina ; Gabriel, Edgar ; Resch, Michael M.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Houston, Houston, TX
  • fYear
    2008
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    When an adaptive software component is employed to select the best-performing implementation for a communication operation at runtime, the correctness of the decision taken strongly depends on detecting and removing outliers in the data used for the comparison. This automatic decision is greatly complicated by the fact that the types and quantities of outliers depend on the network interconnect and the nodes assigned to the job by the batch scheduler. This paper evaluates four different statistical methods used for handling outliers, namely a standard interquartile range method, a heuristic derived from the trimmed mean value, cluster analysis and a method using robust statistics. Using performance data from the Abstract Data and Communication Library (ADCL) we evaluate the correctness of the decisions made with each statistical approach over three fundamentally different network interconnects, namely a highly reliable InfiniBand network, a gigabit Ethernet network having a larger variance in the performance, and a hierarchical gigabit Ethernet network.
  • Keywords
    multiprocessor interconnection networks; parallel processing; scheduling; statistical analysis; workstation clusters; InfiniBand network; adaptive software component; batch scheduler; cluster analysis; gigabit Ethernet network; network interconnect; outlier detection; parallel application; robust statistics; standard interquartile range; statistical method; Application software; Context; Ethernet networks; High performance computing; Runtime; Software libraries; Software performance; Statistical analysis; Switches; Telecommunication network reliability; adaptive communication libraries; outlier detection; performance analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on
  • Conference_Location
    Miami, FL
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4244-1693-6
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2008.4536463
  • Filename
    4536463