• DocumentCode
    1996135
  • Title

    Efficient Hough Transform on the FPGA using DSP Slices and Block RAMs

  • Author

    Xin Zhou ; Tomagou, Norihiro ; Ito, Yu ; Nakano, Kaoru

  • Author_Institution
    Dept. of Inf. Eng., Hiroshima Univ., Higashi-Hiroshima, Japan
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    771
  • Lastpage
    778
  • Abstract
    The main contribution of this paper is to present a new FPGA architecture for the Hough transform that identifies straight lines in a binary image. Recent FPGAs have hundreds of embedded DSP slices and block RAMs. For example, Xilinx Virtex-6 Family FPGAs have a DSP48E1 slice, which is a configurable logic block equipped with fast multipliers, adders, pipeline registers, and so on. They also have a dual-port memory with 18Kbits as a block RAM. One of the most important key techniques for accelerating computation using FPGAs is an efficient usage ofDSP slices and block RAMs. Our new architecture for the Hough transform uses 178 DSP48E1 slices and 180 block RAMs with 18Kbits that work in parallel. As far as we know, there is no previously published work that fully utilizes DSP slices and block RAMs for the Hough transform. Roughly speaking, a conventional sequential implementation performs 180m voting operations for m edge points. Our architecture performs voting operations in parallel, and outputs identified straight lines in m+97 clock cycles. Since 180m voting operations are performed using 178 DSP48E1 slices, the lower bound of the computing time is m clock cycles. Hence our implementation is close to optimal. The implementation results show that the Hough transform for a 512×512 image with 33232 edge points can be done in only 135.75us.
  • Keywords
    Hough transforms; digital signal processing chips; field programmable gate arrays; image processing; parallel architectures; random-access storage; DSP48E1 slices; FPGA architecture; Hough transform; binary image; block RAM; clock cycles; edge points; embedded DSP slices; sequential implementation; voting operations; Clocks; Computer architecture; Digital signal processing; Field programmable gate arrays; Ports (Computers); Random access memory; Transforms; Embedded DSP slices; Embedded block RAMs; FPGA; Hough transform; Image processing; Line detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
  • Conference_Location
    Cambridge, MA
  • Print_ISBN
    978-0-7695-4979-8
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2013.86
  • Filename
    6650954