• DocumentCode
    2866363
  • Title

    Performance Evaluation of OpenMP Constructs and Kernel Benchmarks on a Loongson-3A Quad-Core SMP System

  • Author

    Luo, Qiuming ; Kong, Chang ; Cai, Ye ; Liu, Gang

  • Author_Institution
    Nat. High Performance Comput. Center (NHPCC), Shenzhen Univ., Shenzhen, China
  • fYear
    2011
  • fDate
    20-22 Oct. 2011
  • Firstpage
    191
  • Lastpage
    196
  • Abstract
    As a competitor and alternative to mainstream general-purpose CPU (Intel/AMD/etc.), Loongson is a family of general-purpose MIPS-compatible CPUs developed at the ICT of CAS in China. The quad-core Loongson 3A is evaluated in this paper. The performance of the basic OpenMP constructs on Loongson-3A quad-core SMP is obtained by applying the EPCC Micro benchmarks. And then the performance of NAS kernel codes is obtained by applying NAS Parallel Benchmarks (NPB). These benchmarking are carried out for three different OpenMP compilers (and the runtime system), which includes GCC, OMPipth (OMPi with pthread library) and OMPi-psth (OMPi with psthread library). The results show that OMPI-pth´s performance is the best and OMPi-psth´s performance is the worst. Those test results might help to program the OpenMP codes as well as to select the appropriate compiler and its runtime system. And an Intel core i5 quad-core platform is used for comparison purpose, by running NPB, which implies that Loongson 3A´s performance is nearly one tenth of i5´s. The NPB results can help to defining a Loongson system´s scale when replacing an Intel i5 system for a given problem size.
  • Keywords
    benchmark testing; multiprocessing systems; open systems; parallel processing; performance evaluation; program compilers; shared memory systems; CAS; EPCC microbenchmark; GCC; ICT; Loongson-3A quadcore SMP system; NAS kernel code; NAS parallel benchmark; OMPi-psth; OMPi-pth; OpenMP code; compiler system; general purpose MlPS-compatible CPU; kernel benchmark; mainstream general purpose CPU; performance evaluation; Benchmark testing; Dynamic scheduling; Kernel; Libraries; Processor scheduling; Runtime; Switches; Loongson; Microbenchmarks; NPB; OpenMP; Performance Evaluation; SMP;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Computing, Applications and Technologies (PDCAT), 2011 12th International Conference on
  • Conference_Location
    Gwangju
  • Print_ISBN
    978-1-4577-1807-6
  • Type

    conf

  • DOI
    10.1109/PDCAT.2011.66
  • Filename
    6118935