• DocumentCode
    2389554
  • Title

    Implementation of large-scale fir adaptive filters on NVIDIA GeForce graphics processing unit

  • Author

    Hirano, Akihiro ; Nakayama, Kenji

  • Author_Institution
    Kanazawa Univ., Kanazawa, Japan
  • fYear
    2010
  • fDate
    6-8 Dec. 2010
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper presents implementations of an FIR adaptive filter with a large number of taps on nVIDIA GeForce graphics processing unit (GPU) and CUDA software development environment. In order to overcome a long access latency for slow off-chip memory access, reduction of memory accesses by re-ordering and vector load/store operations and an increase of the number of threads are introduced. A tree adder is introduced to reduce the cost for summing thread outputs up. A simultaneous execution of multiple filters are also examined. On low-cost platform such as an Atom/ION nettop, GPU will accelerates the computation by almost three times. For simultaneous multiple simulations such as an ensemble averaging, a GPU with a large number of processing elements outperforms a dual-core CPU; almost six times faster for 16 runs.
  • Keywords
    FIR filters; adaptive filters; computer graphic equipment; coprocessors; Atom nettop; CUDA software development; GPU; ION nettop; NVIDIA GeForce graphics processing unit; dual core CPU; large scale FIR adaptive filters; off-chip memory access; Filtering algorithms; Graphics processing unit; Linux; Servers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Signal Processing and Communication Systems (ISPACS), 2010 International Symposium on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-1-4244-7369-4
  • Type

    conf

  • DOI
    10.1109/ISPACS.2010.5704666
  • Filename
    5704666