DocumentCode :
660638
Title :
Pipeline-Based Parallel Framework for Mass File Processing
Author :
Tao Liu ; Yi Liu ; Qingquan Wang ; Xiangrong Wang ; Fei Gao ; Depei Qian
Author_Institution :
Sino-German Joint Software Inst., Beihang Univ., Beijing, China
fYear :
2013
fDate :
4-6 Nov. 2013
Firstpage :
42
Lastpage :
48
Abstract :
Currently, there exists billions of files on the Internet, such as pictures, web pages, audio and video files, etc., and the number is still growing rapidly. These huge amount of files need to be processed by some applications as quickly as possible with parallel processing. With the increasing of cores in processors, parallel programming becomes more complex. The behavior that multiple parallel processes/threads access files simultaneously may interfere with each other and cause extra performance loss. Consequently, this paper proposes a pipeline-based parallel framework for mass file processing, in which file processing is divided into multiple stages to compose a pipeline. Files flow through these stages one by one, and the interferences in file-accessing are avoided. Moreover, the parallel programming can be simplified by means of parallel frameworks and programming interfaces. Experiments with one real-world application and some micro-benchmarks show that the framework can efficiently improve system performance.
Keywords :
file organisation; multi-threading; pipeline processing; Internet; Web pages; audio files; file-accessing; mass file processing; parallel processing; parallel programming; pictures; pipeline-based parallel framework; programming interfaces; system performance; threads access files; video files; Parallel programming; Pipeline processing; Pipelines; Prefetching; big data; parallel programming; pipeline framework;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud and Service Computing (CSC), 2013 International Conference on
Conference_Location :
Beijing
Type :
conf
DOI :
10.1109/CSC.2013.15
Filename :
6693177
Link To Document :
بازگشت