DocumentCode :
3286331
Title :
Genetic algorithms in software and in hardware-a performance analysis of workstation and custom computing machine implementations
Author :
Graham, Paul ; Nelson, Brent
Author_Institution :
Dept. of Electr. & Comput. Eng., Brigham Young Univ., Provo, UT, USA
fYear :
1996
fDate :
17-19 Apr 1996
Firstpage :
216
Lastpage :
225
Abstract :
The paper analyzes the performance differences found between the hardware and software versions of a genetic algorithm used to solve the travelling salesman problem. The hardware implementation requires 4 FPGA´s on a Splash 2 board and runs at 11 MHz. The software implementation was written in C++ and executed on a 125 MHz HP PA-RISC workstation. The software run time was more than four times that of the hardware (up to 50 times as many cycles). The paper analyses the contribution made to this performance difference by the following hardware features: hard-wired control, custom address generation logic, memory hierarchy efficiency, and both fine- and course-grained parallelism. The results indicate that the major contributor to the hardware performance advantage is fine-grained parallelism-RTL-level parallelism due to operator pipelining. This alone accounts for as much as a 38X cycle-count reduction over the software in one section of the algorithm. The next major contributors include hard-wired control and custom address generation which account for as much as a 3X speedup in other sections of the algorithm. Finally, memory hierarchy inefficiencies in the software (cache misses and paging) and coarse-grained parallelism in the hardware are each shown to have lesser effect on the performance difference between the implementations
Keywords :
field programmable gate arrays; genetic algorithms; parallel architectures; performance evaluation; pipeline processing; reduced instruction set computing; software performance evaluation; special purpose computers; travelling salesman problems; virtual storage; workstations; 11 MHz; 125 MHz; C++ software implementation; FPGA; HP PA-RISC workstation; RTL-level parallelism; Splash 2 board; cache misses; course-grained parallelism; custom address generation logic; custom computing machine implementations; cycle-count reduction; fine-grained parallelism; genetic algorithms; hard-wired control; hardware; memory hierarchy efficiency; memory hierarchy inefficiencies; operator pipelining; performance analysis; performance difference; software; software run time; travelling salesman problem; Genetic algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
FPGAs for Custom Computing Machines, 1996. Proceedings. IEEE Symposium on
Conference_Location :
Napa Valley, CA
Print_ISBN :
0-8186-7548-9
Type :
conf
DOI :
10.1109/FPGA.1996.564847
Filename :
564847
Link To Document :
بازگشت