• DocumentCode
    1971236
  • Title

    Numerical applications and sub-word parallelism: the NAS benchmarks on a Pentium 4

  • Author

    Daniel, Etiemble

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Toronto Univ., Ont., Canada
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    205
  • Lastpage
    212
  • Abstract
    We examine the impact of Pentium 4 SIMD instructions on the Fortran and C versions of the NAS benchmarks, either by compiler vectorization or by assembly code in-lining. If few functions generally profit from the SIMD operations, the ones using complex numbers or random number generators can be efficiently accelerated.
  • Keywords
    microprocessor chips; parallel processing; performance evaluation; program assemblers; program compilers; NAS benchmarks; Pentium 4 SIMD instructions; assembly code in-lining; compiler vectorization; random number generators; sub-word parallelism; Acceleration; Assembly; Benchmark testing; Costs; Microprocessors; Parallel machines; Parallel processing; Personal communication networks; Random number generation; Registers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing Systems and Applications, 2002. Proceedings. 16th Annual International Symposium on
  • Print_ISBN
    0-7695-1626-2
  • Type

    conf

  • DOI
    10.1109/HPCSA.2002.1019156
  • Filename
    1019156