• DocumentCode
    688359
  • Title

    Performance Pattern of Unified Parallel C on Multi-core Clusters

  • Author

    Hamid, Nor Asilah Wati Abdul ; Serres, Olivier ; Anbar, Ahmad ; Hassan, Shoaib

  • Author_Institution
    Dept. of Electr. & Comput. Eng., George Washington Univ., Washington, DC, USA
  • fYear
    2013
  • fDate
    13-15 Nov. 2013
  • Firstpage
    1751
  • Lastpage
    1757
  • Abstract
    The Partitioned Global Address Space (PGAS) model has been widely used in multi-core clusters as an alternative to MPI. Among the widespread use is Unified Parallel C (UPC). Previous research has shown that UPC performance is comparable with MPI, however in certain cases UPC require hand-tuning techniques such as prefetching and privatized pointers-to-shared to improve the performance. In this paper we reviews, evaluate and analyze the performance pattern between UPC Naïve, UPC optimize and MPI on two different multi-core clusters architecture. We focus our study using matrix multiplication as the benchmark and perform our experimental on two distributed memory machine, Cray XE6 with Gemini interconnects and Sun Cluster with Infiniband interconnects. We provide analysis on each core execution time to understand the pattern of communication for both machines. We also demonstrate the gaps between naïve and optimized are depends on the compiler with its associate distributed memory machine. We also observed unnecessary optimization for certain programs related to HPC architecture and compiler.
  • Keywords
    message passing; multiprocessing systems; parallel processing; performance evaluation; program compilers; Cray XE6; Gemini interconnects; HPC architecture; Infiniband interconnects; MPI; PGAS model; Sun Cluster; UPC; compiler; core execution time; distributed memory machine; high performance computing; message passing interface; multicore clusters architecture; partitioned global address space; prefetching technique; privatized pointers-to-shared technique; unified parallel C; Benchmark testing; Computer architecture; Electronics packaging; Optimization; Program processors; Random access memory; Sun; Gemini; MPI; PGAS; Performance Evaluation; UPC;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
  • Conference_Location
    Zhangjiajie
  • Type

    conf

  • DOI
    10.1109/HPCC.and.EUC.2013.250
  • Filename
    6832132