Title :
Implementation of accelerated BCH decoders on GPU
Author :
Xiaoxia Qi ; Xiao Ma ; Dou Li ; Yuping Zhao
Author_Institution :
Sch. of Electron. Eng. & Comput. Sci., Peking Univ., Beijing, China
Abstract :
With the development of communication systems, the requirement for efficient error correcting code becomes an important issue. In this paper, we address a parallel software implementation of BCH(Bose-Chaudhuri-Hocquenghen) decoding for HINOC (High Performance Network Over Coax) standard, which is an access network technology aiming at solving the bandwidth limit in user area. A highly efficient parallel decoding algorithm of BCH codes based on CU-DA (Compute Unified Device Architecture) is presented. An iterative decoding algorithm is adopted to implement BCH decoders by virtue of the massively parallel architecture advantage of GPU (Graphic Process Unit). The major concern of the BCH decoders devised on GPU is concentrated on the parallel process capability of the decoding algorithm. Through flexible threads assignment and efficient scheduling strategy, the GPU-based BCH decoders are implemented significantly. Specially, we design and carry out BCH(n=504, k=432, t=7) decoders using the proposed approach. Besides, we evaluate its performance with respect to its CPU-based single-threaded counterpart developed in the C++ language. The experimental results show that our proposed GPU-based BCH decoders achieve a significant speedup of more than 50 times improvement. Furthermore, the implementation of the proposed BCH decoder architecture is scalable to various block lengths and various correctable error numbers, thus providing an efficient and convenient approach to do parallel BCH decoding processes on GPU.
Keywords :
BCH codes; C++ language; error correction codes; graphics processing units; iterative decoding; scheduling; Bose-Chaudhuri-Hocquenghen decoding; C++ language; CUDA; GPU; HINOC standard; accelerated BCH decoders; access network technology; compute unified device architecture; error correcting code; graphic process unit; high performance network over coax; iterative decoding; parallel BCH decoding; parallel decoding algorithm; parallel software implementation; single-threaded counterpart; BCH Decoder; CUDA; GPU; Parallel Computing; Thread;
Conference_Titel :
Wireless Communications & Signal Processing (WCSP), 2013 International Conference on
Conference_Location :
Hangzhou
DOI :
10.1109/WCSP.2013.6677084