Title :
Efficient Hough Transform on the FPGA using DSP Slices and Block RAMs
Author :
Xin Zhou ; Tomagou, Norihiro ; Ito, Yu ; Nakano, Kaoru
Author_Institution :
Dept. of Inf. Eng., Hiroshima Univ., Higashi-Hiroshima, Japan
Abstract :
The main contribution of this paper is to present a new FPGA architecture for the Hough transform that identifies straight lines in a binary image. Recent FPGAs have hundreds of embedded DSP slices and block RAMs. For example, Xilinx Virtex-6 Family FPGAs have a DSP48E1 slice, which is a configurable logic block equipped with fast multipliers, adders, pipeline registers, and so on. They also have a dual-port memory with 18Kbits as a block RAM. One of the most important key techniques for accelerating computation using FPGAs is an efficient usage ofDSP slices and block RAMs. Our new architecture for the Hough transform uses 178 DSP48E1 slices and 180 block RAMs with 18Kbits that work in parallel. As far as we know, there is no previously published work that fully utilizes DSP slices and block RAMs for the Hough transform. Roughly speaking, a conventional sequential implementation performs 180m voting operations for m edge points. Our architecture performs voting operations in parallel, and outputs identified straight lines in m+97 clock cycles. Since 180m voting operations are performed using 178 DSP48E1 slices, the lower bound of the computing time is m clock cycles. Hence our implementation is close to optimal. The implementation results show that the Hough transform for a 512×512 image with 33232 edge points can be done in only 135.75us.
Keywords :
Hough transforms; digital signal processing chips; field programmable gate arrays; image processing; parallel architectures; random-access storage; DSP48E1 slices; FPGA architecture; Hough transform; binary image; block RAM; clock cycles; edge points; embedded DSP slices; sequential implementation; voting operations; Clocks; Computer architecture; Digital signal processing; Field programmable gate arrays; Ports (Computers); Random access memory; Transforms; Embedded DSP slices; Embedded block RAMs; FPGA; Hough transform; Image processing; Line detection;
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
Conference_Location :
Cambridge, MA
Print_ISBN :
978-0-7695-4979-8
DOI :
10.1109/IPDPSW.2013.86