Title :
High-speed parallel implementation of a modified PBR algorithm on DSP-based EH topology
Author :
Rajan, K. ; Patnaik, L.M. ; Ramakrishna, J.
Author_Institution :
Dept. of Phys., Indian Inst. of Sci., Bangalore, India
fDate :
8/1/1997 12:00:00 AM
Abstract :
Algebraic Reconstruction Technique (ART) is an age-old method used for solving the problem of three-dimensional (3-D) reconstruction from projections in electron microscopy and radiology. In medical applications, direct 3-D reconstruction is at the forefront of investigation. The simultaneous iterative reconstruction technique (SIRT) is an ART-type algorithm with the potential of generating in a few iterations tomographic images of a quality comparable to that of convolution backprojection (CBP) methods. Pixel-based reconstruction (PBR) is similar to SIRT reconstruction, and it has been shown that PBR algorithms give better quality pictures compared to those produced by SIRT algorithms. In this work, we propose a few modifications to the PBR algorithms. The modified algorithms are shown to give better quality pictures compared to PBR algorithms. The PBR algorithm and the modified PBR algorithms are highly compute intensive, Not many attempts have been made to reconstruct objects in the true 3-D sense because of the high computational overhead. In this study, we have developed parallel two-dimensional (2-D) and 3-D reconstruction algorithms based on modified PBR. We attempt to solve the two problems encountered by the PBR and modified PBR algorithms, i.e., the long computational time and the large memory requirements, by parallelizing the algorithm on a multiprocessor system. We investigate the possible task and data partitioning schemes by exploiting the potential parallelism in the PBR algorithm subject to minimizing the memory requirement. We have implemented an extended hypercube (EH) architecture for the high-speed execution of the 3-D reconstruction algorithm using the commercially available fast floating point digital signal processor (DSP) chips as the processing elements (PEs) and dual-port random access memories (DPR) as channels between the PEs. We discuss and compare the performances of the PBR algorithm on an IBM 6000 RISC workstation, on a Silicon Graphics Indigo 2 workstation, and on an EH system. The results show that an EH(3,1) using DSP chips as PEs executes the modified PBR algorithm about 100 times faster than an LBM 6000 RISC workstation. We have executed the algorithms on a 4-node IBM SP2 parallel computer. The results show that execution time of the algorithm on an EH(3,1) is better than that of a 4-node IBM SP2 system. The speed-up of an EH(3,1) system with eight PEs and one network controller is approximately 7.85
Keywords :
computerised tomography; diagnostic radiography; hypercube networks; image reconstruction; medical image processing; multiprocessing programs; parallel algorithms; DSP-based EH topology; IBM 6000 RISC workstation; algebraic reconstruction technique; computational time; convolution backprojection; data partitioning; dual-port random access memories; electron microscopy; extended hypercube architecture; fast floating point digital signal processor; high-speed parallel implementation; medical applications; memory requirements; modified PBR algorithm; multiprocessor system; pixel-based reconstruction; radiology; simultaneous iterative reconstruction technique; tomographic images; Bioreactors; Concurrent computing; Digital signal processing chips; Image reconstruction; Iterative algorithms; Partitioning algorithms; Reduced instruction set computing; Signal processing algorithms; Three dimensional displays; Workstations;
Journal_Title :
Nuclear Science, IEEE Transactions on