• DocumentCode
    2321899
  • Title

    The GPU Enhanced Parallel Computing for Large Scale Data Clustering

  • Author

    Cui, Xiaohui ; Charles, Jesse St ; Potok, Thomas E.

  • Author_Institution
    Oak Ridge Nat. Lab., Oak Ridge, TN, USA
  • fYear
    2011
  • fDate
    10-12 Oct. 2011
  • Firstpage
    220
  • Lastpage
    225
  • Abstract
    Analyzing and clustering large scale data set is a complex problem. One explored method of solving this problem borrows from nature, imitating the flocking behavior of birds. One limitation of this method of data clustering is its complexity O(n2). As the number of data and feature dimensions grows, it becomes increasingly difficult to generate results in a reasonable amount of time. In the last few years, the graphics processing unit (GPU) has received attention for its ability to solve highly-parallel and semi-parallel problems much faster than the traditional sequential processor. In this chapter, we have conducted research to exploit this architecture and apply its strengths to the flocking based data clustering problem. Using the CUDA platform from NVIDIA, we developed a Multiple Species Data Flocking implementation to be run on the NVIDIA GPU. Performance gains ranged from 30 to 60 times improvement of the GPU over the CPU implementation.
  • Keywords
    computational complexity; computer graphic equipment; coprocessors; data handling; parallel processing; pattern clustering; CUDA platform; GPU enhanced parallel computing; NVIDIA GPU; flocking based data clustering problem; graphics processing unit; large scale data analysis; large scale data clustering; multiple species data flocking; sequential processor; Birds; Clustering algorithms; Graphics processing unit; Instruction sets; Kernel; Runtime; Vectors; GPU; clustering; flocking; large scale;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), 2011 International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4577-1827-4
  • Type

    conf

  • DOI
    10.1109/CyberC.2011.44
  • Filename
    6079384