• DocumentCode
    3089225
  • Title

    Optimizing Option Pricing Algorithms and Profiling Power Consumption on VLIW APU Architecture

  • Author

    Doerksen, Matthew ; Thulasiraman, Parimala ; Thulasiram, Ruppa K.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Manitoba, Winnipeg, MB, Canada
  • fYear
    2012
  • fDate
    10-13 July 2012
  • Firstpage
    71
  • Lastpage
    78
  • Abstract
    Heterogeneous multi-core architectures have become an integral component of high performance systems and high performance scientific computing (HPC). The use of these systems has been vital for research applications but until recently have not been a factor in the consumer level experience. However, with new technologies such as AMD´s Accelerated Processing Unit (APU) which combines the Central Processing Unit and Graphics Processing Unit onto a single die, consumers now have an affordable high performance system at their disposal. AMD´s APUs are aimed at providing good performance and low power consumption for all markets. Financial applications can benefit from this heterogeneous architecture for real time processing. However, to obtain good performance, algorithms must be coded to efficiently utilize the APU architecture. In this paper, we have optimized two option pricing algorithms on the APU making use of vectorization and loop unrolling for improved performance. Our algorithms are tested on both an ATI Mobility Radeon 5870 and an AMD E-350 APU which use the VLIW5 architecture. We also study the power consumption of these architectures to determine how they compare to traditional CPU- and GPU- based systems.
  • Keywords
    graphics processing units; multiprocessing systems; parallel architectures; pricing; real-time systems; AMD E-350 APU; AMD accelerated processing unit; ATI mobility Radeon 5870; HPC; VLIW APU architecture; VLIW5 architecture; central processing unit; consumer level experience; graphics processing unit; heterogeneous multicore architectures; high performance scientific computing; integral component; loop unrolling; low power consumption; option pricing algorithm optimization; profiling power consumption; realtime processing; vectorization; Computer architecture; Europe; Graphics processing unit; Instruction sets; Lattices; Power demand; Pricing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing with Applications (ISPA), 2012 IEEE 10th International Symposium on
  • Conference_Location
    Leganes
  • Print_ISBN
    978-1-4673-1631-6
  • Type

    conf

  • DOI
    10.1109/ISPA.2012.18
  • Filename
    6280277