• DocumentCode
    3286331
  • Title

    Genetic algorithms in software and in hardware-a performance analysis of workstation and custom computing machine implementations

  • Author

    Graham, Paul ; Nelson, Brent

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Brigham Young Univ., Provo, UT, USA
  • fYear
    1996
  • fDate
    17-19 Apr 1996
  • Firstpage
    216
  • Lastpage
    225
  • Abstract
    The paper analyzes the performance differences found between the hardware and software versions of a genetic algorithm used to solve the travelling salesman problem. The hardware implementation requires 4 FPGA´s on a Splash 2 board and runs at 11 MHz. The software implementation was written in C++ and executed on a 125 MHz HP PA-RISC workstation. The software run time was more than four times that of the hardware (up to 50 times as many cycles). The paper analyses the contribution made to this performance difference by the following hardware features: hard-wired control, custom address generation logic, memory hierarchy efficiency, and both fine- and course-grained parallelism. The results indicate that the major contributor to the hardware performance advantage is fine-grained parallelism-RTL-level parallelism due to operator pipelining. This alone accounts for as much as a 38X cycle-count reduction over the software in one section of the algorithm. The next major contributors include hard-wired control and custom address generation which account for as much as a 3X speedup in other sections of the algorithm. Finally, memory hierarchy inefficiencies in the software (cache misses and paging) and coarse-grained parallelism in the hardware are each shown to have lesser effect on the performance difference between the implementations
  • Keywords
    field programmable gate arrays; genetic algorithms; parallel architectures; performance evaluation; pipeline processing; reduced instruction set computing; software performance evaluation; special purpose computers; travelling salesman problems; virtual storage; workstations; 11 MHz; 125 MHz; C++ software implementation; FPGA; HP PA-RISC workstation; RTL-level parallelism; Splash 2 board; cache misses; course-grained parallelism; custom address generation logic; custom computing machine implementations; cycle-count reduction; fine-grained parallelism; genetic algorithms; hard-wired control; hardware; memory hierarchy efficiency; memory hierarchy inefficiencies; operator pipelining; performance analysis; performance difference; software; software run time; travelling salesman problem; Genetic algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    FPGAs for Custom Computing Machines, 1996. Proceedings. IEEE Symposium on
  • Conference_Location
    Napa Valley, CA
  • Print_ISBN
    0-8186-7548-9
  • Type

    conf

  • DOI
    10.1109/FPGA.1996.564847
  • Filename
    564847