• DocumentCode
    2369473
  • Title

    Parallel LDPC decoder implementation on GPU based on unbalanced memory coalescing

  • Author

    Kang, Soonyoung ; Moon, Jaekyun

  • Author_Institution
    Dept. of Electr. Eng., Korea Adv. Inst. of Sci. & Technol., Daejeon, South Korea
  • fYear
    2012
  • fDate
    10-15 June 2012
  • Firstpage
    3692
  • Lastpage
    3697
  • Abstract
    We consider flexible decoder implementation of low density parity check (LDPC) codes via compute-unified-device-architecture (CUDA) programming on graphics processing unit (GPU), a research subject of considerable recent interest. An important issue in LDPC decoder design based on CUDA-GPU is realizing coalesced memory access, a technique that reduces memory transaction time considerably. In previous works along this direction, it has not been possible to achieve coalesced memory access in both the read and write operations due to the asymmetric nature of the bipartite graph describing the LDPC code structure. In this paper, a new algorithm is proposed that enables coalesced memory access in both the read and write operations for one half of the decoding process - either the bit-to-check or the check-to-bit message passing. For the remaining half of the decoding step our scheme requires address transformation in both the read and write operations but one translating array is sufficient. We also describe the use of on-chip shared memory and texture cache. Overall, experimental results show that proposed GPU-based LDPC decoder achieves more than 234×-speedup compared to CPU-based LDPC decoders and also outperforms existing GPU-based decoders by a significant margin.
  • Keywords
    decoding; graph theory; graphics processing units; message passing; parallel architectures; parity check codes; CUDA-GPU decoder; LDPC code structure; bipartite graph; bit-to-check message passing; check-to-bit message passing; coalesced memory access; compute-unified-device-architecture programming; decoding process; graphics processing unit; low density parity check codes; memory transaction time reduction; on-chip shared memory; parallel LDPC decoder design; texture cache; unbalanced memory coalescing; Arrays; Decoding; Error analysis; Graphics processing units; Memory management; Message passing; Parity check codes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications (ICC), 2012 IEEE International Conference on
  • Conference_Location
    Ottawa, ON
  • ISSN
    1550-3607
  • Print_ISBN
    978-1-4577-2052-9
  • Electronic_ISBN
    1550-3607
  • Type

    conf

  • DOI
    10.1109/ICC.2012.6363991
  • Filename
    6363991