Title :
Acceleration of variance of color differences-based demosaicing using CUDA
Author :
Faruqi, Muhammad Ismail ; Ino, Fumihiko ; Hagihara, Kenichi
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Osaka Univ., Suita, Japan
Abstract :
Image demosaicing algorithms are used to reconstruct a full color image from the incomplete color samples output (RAW data) of an image sensor overlaid with a Color Filter Array (CFA). Better demosaicing algorithms are superior in terms of acuity, dynamic range, signal to noise ratio, and artifact suppression, which make them suitable for high quality delivery such as theatrical broadcast. In this paper, we present our efforts in examining the feasibility of exploiting the Graphics Processing Unit (GPU) as an emerging accelerator to create an on-the-fly implementation of Variance of Color Differences (VCD) demosaicing, a state-of-the-art heuristic demosaicing algorithm developed to eliminate false-color artifacts in texture region of images. Our efforts in this paper are 1) implementing the algorithm as several kernels to separate the bottleneck portion of the algorithm from the rest and to minimize idle threads and 2) reducing I/O between shared and global memory when performing green channel interpolation by separating the input RAW data into four channels. We then compare the implementation featuring both acceleration methods with a single kernel implementation. Based on experimental results, our proposed acceleration methods achieved per-frame processing time of 343 ms on an nVidia GTX 480, which translates into 2.95 fps. Additionally, our proposed methods were also able to accelerate the kernel time and the effective memory bandwidth by a factor of 2.1× compared with its single kernel counterpart.
Keywords :
filtering theory; graphics processing units; image colour analysis; image segmentation; image sensors; image texture; parallel architectures; CUDA; GPU; I/O reduction; acceleration method; artifact suppression; bottleneck portion separation; color differences-based demosaicing; color filter array; false-color artifact elimination; full color image reconstruction; global memory; graphics processing unit; green channel interpolation; heuristic demosaicing algorithm; idle thread minimisation; image demosaicing algorithm; image sensor; nVidia GTX 480; shared memory; signal to noise ratio; single kernel implementation; texture region; variance acceleration; variance of color differences demosaicing; Acceleration; Color; Graphics processing unit; Image color analysis; Instruction sets; Interpolation; Kernel; CUDA; GPU; Parallel processing; image demosaicing;
Conference_Titel :
High Performance Computing and Simulation (HPCS), 2012 International Conference on
Conference_Location :
Madrid
Print_ISBN :
978-1-4673-2359-8
DOI :
10.1109/HPCSim.2012.6266965