Title :
Efficient Acceleration of Mutual Information Computation for Nonrigid Registration Using CUDA
Author :
Ikeda, Ken-ichi ; Ino, Fumihiko ; Hagihara, Kohei
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Osaka Univ., Suita, Japan
Abstract :
In this paper, we propose an efficient acceleration method for the nonrigid registration of multimodal images that uses a graphics processing unit. The key contribution of our method is efficient utilization of on-chip memory for both normalized mutual information (NMI) computation and hierarchical B-spline deformation, which compose a well-known registration algorithm. We implement this registration algorithm as a compute unified device architecture program with an efficient parallel scheme and several optimization techniques such as hierarchical data organization, data reuse, and multiresolution representation. We experimentally evaluate our method with four clinical datasets consisting of up to 512 × 512 × 296 voxels. We find that exploitation of on-chip memory achieves a 12-fold increase in speed over an off-chip memory version and, therefore, it increases the efficiency of parallel execution from 4% to 46%. We also find that our method running on a GeForce GTX 580 card is approximately 14 times faster than a fully optimized CPU-based implementation running on four cores. Some multimodal registration results are also provided to understand the limitation of our method. We believe that our highly efficient method, which completes an alignment task within a few tens of seconds, will be useful to realize rapid nonrigid registration.
Keywords :
digital storage; graphics processing units; image registration; information theory; medical image processing; parallel architectures; splines (mathematics); CUDA; GeForce GTX 580 card; NMI computation; alignment task; clinical datasets; compute unified device architecture program; data reuse; efficient mutual information computation acceleration; efficient on-chip memory utilization; fully optimized CPU-based implementation; graphics processing unit; hierarchical B-spline deformation; hierarchical data organization; multiresolution representation; nonrigid multimodal image registration; normalized mutual information computation; off-chip memory; optimization techniques; parallel execution efficiency; rapid nonrigid registration; registration algorithm; Acceleration; Graphics processing units; Histograms; Instruction sets; Joints; Memory management; Splines (mathematics); Acceleration; compute unified device architecture (CUDA); graphics processing unit (GPU); mutual information (MI); nonrigid registration;
Journal_Title :
Biomedical and Health Informatics, IEEE Journal of
DOI :
10.1109/JBHI.2014.2310745