DocumentCode :
2996085
Title :
Deterministic Timing-Driven Parallel Placement by Simulated Annealing Using Half-Box Window Decomposition
Author :
Goeders, Jeffrey B. ; Lemieux, Guy G F ; Wilton, Steven J E
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of British Columbia, Vancouver, BC, Canada
fYear :
2011
fDate :
Nov. 30 2011-Dec. 2 2011
Firstpage :
41
Lastpage :
48
Abstract :
As each generation of FPGAs grow in size, the run time of the associated CAD tools is rapidly increasing. Many past efforts have aimed at improving the CAD run time through parallelization of the placement algorithm. Wang and Lemieux presented an algorithm that is scalable, deterministic, timing-driven and achieves speedup over VPR [Wang and Lemieux FPGA´11]. This paper provides two significant alterations to Wang and Lemieux´s algorithm, resulting in additional speedup and quality improvement. The first contribution is a new data decomposition scheme, called the half-box window technique, which achieves speedup by reducing the frequency of thread synchronization. The second contribution is the development of an improved annealing schedule, which further improves run time and slightly improves the quality of results. Together, these modifications achieve run time speedups of up to 70%. To put this in perspective, Wang and Lemieux required 25 threads to achieve best speedup, while this work requires only 16 threads. For a 10% degradation in quality, the new 16-thread algorithm achieves a 51x speedup over VPR, compared to a 35x speedup by the 25-thread original algorithm. Regarding quality, the best quality of results achieved by the new algorithm is a 5% degradation versus VPR, compared to a 8% degradation of the original Wang and Lemieux algorithm.
Keywords :
field programmable gate arrays; logic CAD; multi-threading; parallel processing; simulated annealing; 16-thread algorithm; CAD tools; FPGA; Lemieux algorithm; Wang algorithm; annealing schedule; data decomposition scheme; deterministic timing-driven parallel placement; half-box window decomposition; placement algorithm parallelization; quality improvement; simulated annealing; thread synchronization; Annealing; Degradation; Design automation; Field programmable gate arrays; Program processors; Schedules; Synchronization; CAD; FPGA; parallel placement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Reconfigurable Computing and FPGAs (ReConFig), 2011 International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4577-1734-5
Type :
conf
DOI :
10.1109/ReConFig.2011.27
Filename :
6128552
Link To Document :
بازگشت