Title :
A reconfigurable GPU implementation for Tomlinson-Harashima precoding
Author :
Domene, Fernando ; Roger, Sandra ; Ramiro, Carla ; Piñero, Gema ; Gonzalez, Alberto
Author_Institution :
Inst. of Telecommun. & Multimedia Applic., Univ. Politec. de Valencia, Valencia, Spain
Abstract :
Fast parallel processing capability of general purpose Graphic Processing Units (GPU) can be exploited to accelerate the precoding calculation needed in spatially multiplexed wireless communication systems. In this paper, a GPU-based implementation of the well-known multiuser Tomlinson-Harashima precoding (THP) scheme combined with a lattice-reduction (LR) stage is presented. The proposed approach allows the LR stage to be switched off when user requirements are achieved by using only THP. Moreover, our GPU implementation provides scalability in the number of sub-carriers per symbol, which is a key factor in LTE and 4G wireless standards. Simulation results show that the GPU-based THP implementation performs up to 7 times faster than its CPU-equivalent whereas the LR stage implementation only achieves a speedup of 3. Despite the fact that the LR cannot be as efficiently parallelized as the THP, a speedup of nearly 6 is achieved when both are combined.
Keywords :
4G mobile communication; Long Term Evolution; graphics processing units; multi-access systems; parallel processing; precoding; 4G wireless standards; CPU; GPU; LTE; graphics processing units; lattice-reduction stage; multiuser Tomlinson-Harashima precoding scheme; parallel processing; precoding calculation; spatially multiplexed wireless communication systems; Graphics processing unit; Instruction sets; Kernel; MIMO; OFDM; Signal processing algorithms; Switches; GPU; Multiuser precoding; Tomlinson-Harashima Precoding;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288207