Title :
Efficient template matching with variable size templates in CUDA
Author :
Moore, Nicholas ; Leeser, Miriam ; King, Laurie Smith
Author_Institution :
Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA, USA
Abstract :
Graphics processing units (GPUs) offer significantly higher peak performance than CPUs, but for a limited problem space. Even within this space, GPU solutions are often restricted to a set of specific problem instances or offer greatly varying performance for slightly different parameters. This makes providing a library of GPU implementations that is adaptable to arbitrary inputs a difficult task. This research is motivated by a MATLAB lung tumor tracking application that relies on two-dimensional correlation and uses large template sizes. While GPU-based template matching has been addressed in the past, template sizes were limited to specific, relatively small sizes and not acceptable for accelerating the target application. This paper discusses a CUDA implementation that supports large template sizes and is adaptable to arbitrary template dimensions. The implementation uses on-demand compilation of kernels and compile-time expansion of various kernel parameters to improve the implementation adaptability without sacrificing performance.
Keywords :
computer graphic equipment; coprocessors; image matching; lung; mathematics computing; medical computing; parallel processing; tumours; CUDA; MATLAB lung tumor tracking application; efficient template matching; graphic processing units; kernels; on demand compilation; Acceleration; Computer science; Educational institutions; Kernel; Libraries; Lung neoplasms; MATLAB; Mathematics; Target tracking; Yarn;
Conference_Titel :
Application Specific Processors (SASP), 2010 IEEE 8th Symposium on
Conference_Location :
Anaheim, CA
Print_ISBN :
978-1-4244-7953-5
DOI :
10.1109/SASP.2010.5521142