مرکز منطقه ای اطلاع رساني علوم و فناوري - Genetic algorithms in software and in hardware-a performance analysis of workstation and custom computing machine implementations

DocumentCode :

3286331

Title :

Genetic algorithms in software and in hardware-a performance analysis of workstation and custom computing machine implementations

Author :

Graham, Paul ; Nelson, Brent

Author_Institution :

Dept. of Electr. & Comput. Eng., Brigham Young Univ., Provo, UT, USA

fYear :

1996

fDate :

17-19 Apr 1996

Firstpage :

216

Lastpage :

225

Abstract :

The paper analyzes the performance differences found between the hardware and software versions of a genetic algorithm used to solve the travelling salesman problem. The hardware implementation requires 4 FPGA´s on a Splash 2 board and runs at 11 MHz. The software implementation was written in C++ and executed on a 125 MHz HP PA-RISC workstation. The software run time was more than four times that of the hardware (up to 50 times as many cycles). The paper analyses the contribution made to this performance difference by the following hardware features: hard-wired control, custom address generation logic, memory hierarchy efficiency, and both fine- and course-grained parallelism. The results indicate that the major contributor to the hardware performance advantage is fine-grained parallelism-RTL-level parallelism due to operator pipelining. This alone accounts for as much as a 38X cycle-count reduction over the software in one section of the algorithm. The next major contributors include hard-wired control and custom address generation which account for as much as a 3X speedup in other sections of the algorithm. Finally, memory hierarchy inefficiencies in the software (cache misses and paging) and coarse-grained parallelism in the hardware are each shown to have lesser effect on the performance difference between the implementations

Keywords :

field programmable gate arrays; genetic algorithms; parallel architectures; performance evaluation; pipeline processing; reduced instruction set computing; software performance evaluation; special purpose computers; travelling salesman problems; virtual storage; workstations; 11 MHz; 125 MHz; C++ software implementation; FPGA; HP PA-RISC workstation; RTL-level parallelism; Splash 2 board; cache misses; course-grained parallelism; custom address generation logic; custom computing machine implementations; cycle-count reduction; fine-grained parallelism; genetic algorithms; hard-wired control; hardware; memory hierarchy efficiency; memory hierarchy inefficiencies; operator pipelining; performance analysis; performance difference; software; software run time; travelling salesman problem; Genetic algorithms;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

FPGAs for Custom Computing Machines, 1996. Proceedings. IEEE Symposium on

Conference_Location :

Napa Valley, CA

Print_ISBN :

0-8186-7548-9

Type :

conf

DOI :

10.1109/FPGA.1996.564847

Filename :

564847

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3286331