DocumentCode :
640615
Title :
A CUDA-enabled Hadoop cluster for fast distributed image processing
Author :
Malakar, Ranajoy ; Vydyanathan, Naga
Author_Institution :
Corp. Res. & Technol., Siemens Technol. Services, Bangalore, India
fYear :
2013
fDate :
21-23 Feb. 2013
Firstpage :
1
Lastpage :
5
Abstract :
Hadoop is a map-reduce based distributed processing framework, frequently used in the industry today, in areas of big data analysis, particularly text analysis. Graphics processing units (GPUs), on the other hand, are massively parallel platforms with attractive performance to price and power ratios, used extensively in the recent years for acceleration of data parallel computations. CUDA or Compute Unified Device Architecture is a C-based programming model proposed by NVIDIA for leveraging the parallel computing capabilities of the GPU for general purpose computations. This paper attempts to integrate CUDA acceleration into the Hadoop distributed processing framework to create a heterogeneous high performance image processing system. As Hadoop primarily is used for text analysis, this involves facilitating efficient image processing in Hadoop. Our experimental evaluations using a Adaboost based face detection algorithm indicate that CUDA-enabling a Hadoop cluster, even with low-end GPUs, can result in a 25% improvement in data processing throughput, indicating that an integration of these two technologies can help build scalable, high throughput, power and cost-efficient computing platforms.
Keywords :
C language; data analysis; face recognition; graphics processing units; learning (artificial intelligence); parallel architectures; text analysis; Adaboost based face detection algorithm; C-based programming model; CUDA-enabled Hadoop cluster; Compute Unified Device Architecture; GPU; Hadoop distributed processing framework; Map-reduce based distributed processing framework; NVIDIA; big data analysis; data parallel computations; distributed image processing; general purpose computations; graphics processing units; heterogeneous high performance image processing system; parallel platforms; text analysis; Acceleration; Face detection; Graphics processing units; Image processing; Java; Streaming media; Throughput; CUDA; GPGPU; Hadoop; Map-reduce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Computing Technologies (PARCOMPTECH), 2013 National Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4799-1589-7
Type :
conf
DOI :
10.1109/ParCompTech.2013.6621392
Filename :
6621392
Link To Document :
بازگشت