Title :
The effects of datapath placement and C-slow retiming on three computational benchmarks
Author :
Weaver, Nicholas ; Wawrzynek, John
Author_Institution :
Div. of Comput. Sci., California Univ., Berkeley, CA, USA
Abstract :
Summary form only given. Two important optimizations within the FPGA design process, C-slow retiming and datapath placement, offer significant benefits for designers. Many have advocated and implemented tools to use these techniques in both automatic and semiautomatic manner but they have not made their way into conventional FPGA toolflows. C-slow retiming is a method of accelerating computations that include feedback loops. Instead of having a single instance of the computation, the feedback loop is pipelined so that C separate instances are all calculated simultaneously. This allows fine grained pipelining to occur even in designs that include feedback loops, such as single round cryptographic implementations or microprocessors. Done properly, it imposes a significant but not imposing latency penalty for single computations while offering huge increases in throughput. Datapath placement is simply constructing the design in a manner that accounts for the higher level data flows. This offers several benefits, including improved performance, more physically compact designs, shorter wires, and faster place and route times when the FPGA is heavily utilized. Even for designs with less structure which are amenable to simulated annealing, datapath placement may still offer a significant benefit. To clearly demonstrate the importance of these optimizations we have hand-modified three computational benchmarks which represent significant themes within FPGA computation: Rijndael/AES encryption, Smith/Waterman, and a simplified 32-bit microprocessor datapath. All three represent significantly different modes of computation within FPGAs, but all gain significantly from the use of these techniques.
Keywords :
circuit layout CAD; field programmable gate arrays; logic CAD; performance evaluation; C-slow retiming; FPGA design; datapath placement; optimizations; performance; pipelining; placement; Acceleration; Cryptography; Delay; Design optimization; Feedback loop; Field programmable gate arrays; Microprocessors; Pipeline processing; Process design; Throughput;
Conference_Titel :
Field-Programmable Custom Computing Machines, 2002. Proceedings. 10th Annual IEEE Symposium on
Print_ISBN :
0-7695-1801-X
DOI :
10.1109/FPGA.2002.1106694