• DocumentCode
    2453644
  • Title

    A Chunking Method for Euclidean Distance Matrix Calculation on Large Dataset Using Multi-GPU

  • Author

    Li, Qi ; Kecman, Vojislav ; Salman, Raied

  • Author_Institution
    Dept. of Comput. Sci. Sch. of Eng., Virginia Commonwealth Univ., Richmond, VA, USA
  • fYear
    2010
  • fDate
    12-14 Dec. 2010
  • Firstpage
    208
  • Lastpage
    213
  • Abstract
    Calculating Euclidean distance matrix is a data intensive operation and becomes computationally prohibitive for large datasets. Recent development of Graphics Processing Units (GPUs) has produced superb performance on scientific computing problems using massive parallel processing cores. However, due to the limited size of device memory, many GPU based algorithms have low capability in solving problems with large datasets. In this paper, a chunking method is proposed to calculate Euclidean distance matrix on large datasets. This is not only designed for scalability in multi-GPU environment but also to maximize the computational capability of each individual GPU device. We first implement a fast GPU algorithm that is suitable for calculating sub matrices of Euclidean distance matrix. Then we utilize a Map-Reduce like framework to split the final distance matrix calculation into many small independent jobs of calculating partial distance matrices, which can be efficiently solved by our GPU algorithm. The framework also dynamically allocates GPU resources to those independent jobs for maximum performance. The experimental results have shown a speed up of 15x on datasets which contain more than half million data points.
  • Keywords
    coprocessors; matrix algebra; parallel processing; Euclidean distance matrix; chunking method; graphics processing units; map-reduce; massive parallel processing cores; multi-GPU environment; Euclidean distance; Graphics processing unit; Instruction sets; Kernel; Matrix decomposition; Programming; Symmetric matrices; Chunking; Euclidean Distance Matrix; Multi-GPU;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    978-1-4244-9211-4
  • Type

    conf

  • DOI
    10.1109/ICMLA.2010.38
  • Filename
    5708835