• DocumentCode
    3385027
  • Title

    Parallel Implementation of 2D-DWT by Purging Read after Write Dependency for High Speed Applications

  • Author

    Ashraf, M. ; Baig, Mirza Sami ; Khan, L.A. ; Hassan, Asif

  • Author_Institution
    Center for Cyber Technol. & Spectrum Manage., Nat. Univ. of Sci. & Technol., Islamabad
  • fYear
    2007
  • fDate
    12-13 Nov. 2007
  • Firstpage
    263
  • Lastpage
    268
  • Abstract
    This paper proposes an efficient implementation of multistage, multiple-level DSP algorithms suitable for parallel and distributed processing. To describe our method we selected Mallat\´s algorithm for two dimensional wavelet transforms (2D-DWT) coefficient computation which has multistage and multilevel processing requirements. We have selected field programmable gate arrays (FPGA) as a processing unit because of its inherited parallel processing capabilities but our method is not limited to FPGAs only. Our method directly computes 2D-DWT coefficients without computing and storing intermediate results; which makes it faster; resource saving and removes read after Write (RAW) dependencies. We discuss multistage and single level implementation but ideally it can be extended to n-level implementation. We also proposed method for generation of "mutually scaled filter coefficients (MSFC)" and computation of maximum number of parallel processors for optimized performance in this particular case. Both lookup tables (LUTs) and multipliers along with addition/subtraction architecture can be used. However, LUTs have advantage of high processing speed. Two computational stages are combined into a single stage to remove read after write (RAW) dependency. Our implementation takes N2/4+L2/4-2 time units to compute 2D- DWT of N x N input data, with filter length of L without intermediate storage. This method can be used for other multistage, multilevel DSP problems. Quartusreg II IDE and Altera Stratix device is used for implementation.
  • Keywords
    discrete wavelet transforms; field programmable gate arrays; image processing; parallel processing; table lookup; 2D-DWT; field programmable gate arrays; lookup tables; multistage multiple-level DSP algorithms; mutually scaled filter coefficients; parallel processing; read after write dependencies; two dimensional wavelet transforms; Computer architecture; Concurrent computing; Digital signal processing; Distributed processing; Field programmable gate arrays; Filters; Optimization methods; Parallel processing; Table lookup; Wavelet transforms; Concurrent Readable Memory; Image processing; Parallel Processing; Pipelining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Emerging Technologies, 2007. ICET 2007. International Conference on
  • Conference_Location
    Islamabad
  • Print_ISBN
    978-1-4244-1493-2
  • Electronic_ISBN
    978-1-4244-1494-9
  • Type

    conf

  • DOI
    10.1109/ICET.2007.4516355
  • Filename
    4516355