DocumentCode
653515
Title
The Dynamically Efficient Mechanism of HDFS Data Prefetching
Author
Shaochun Wu ; Guobing Zou ; Honghao Zhu ; Xiang Shuai ; Liang Chen ; Bofeng Zhang
Author_Institution
Sch. of Comput. Eng. & Sci., Shanghai Univ., Shanghai, China
fYear
2013
fDate
20-23 Aug. 2013
Firstpage
2188
Lastpage
2193
Abstract
In recent years, along with cloud computing developing as a widely used computing paradigm, Hadoop Distributed File System (HDFS) has become one of the mandatory techniques, which has many important features, such as master and slave construction of HDFS, direct client accessing, and multi-duplicate of each data block. All of these make HDFS data prefetching much harder than the traditional data acquisition approaches. Moreover, the basic problems of HDFS data prefetching mainly include what kind of data to prefetch, where to prefetch data, how many data to prefetch, and the balance of prefetching data services and normal data access conflicts. Under above analysis, this paper tries to solve these problems and propose the mechanism of the two-layer HDFS data prefetching. The experimental results show that the Hadoop platform which offers data prefetching mechanism can improve 60% of whole performance on data prefetching.
Keywords
distributed databases; storage management; HDFS data prefetching; HDFS master-slave construction; Hadoop distributed file system; computing paradigm; data access conflicts; data acquisition approaches; data block multiduplication; direct client accessing; Collaboration; Delays; Distributed databases; Internet; Prefetching; Servers; Text analysis; Cloud computing; Data block; Data prefetching; HDFS; Metadata;
fLanguage
English
Publisher
ieee
Conference_Titel
Green Computing and Communications (GreenCom), 2013 IEEE and Internet of Things (iThings/CPSCom), IEEE International Conference on and IEEE Cyber, Physical and Social Computing
Conference_Location
Beijing
Type
conf
DOI
10.1109/GreenCom-iThings-CPSCom.2013.413
Filename
6682423
Link To Document