DocumentCode :
187074
Title :
A Distributed NameNode Cluster for a Highly-Available Hadoop Distributed File System
Author :
Yonghwan Kim ; Araragi, Tadashi ; Nakamura, Jun ; Masuzawa, Toru
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Osaka Univ., Suita, Japan
fYear :
2014
fDate :
6-9 Oct. 2014
Firstpage :
333
Lastpage :
334
Abstract :
Recently, Hadoop attracts much attention of engineers and researchers as an emerging and effective framework for Big Data. HDFS (Hadoop Distributed File System) can manage huge amount of data with high performance and reliability using only commodity hardware. However, HDFS requires a single master node, called a NameNode, to manage the entire namespace of the file system. This causes the SPOF (Single Point Of Failure) problem because the file system becomes inaccessible when the NameNode fails. This also causes a bottleneck of efficiency since all the access requests to the file system have to contact the NameNode. Finally the scale up of a namespace is difficult because the NameNode manages all metadata of the namespace on its own memory, which is limited and expensive resource. In this paper, we propose a new HDFS architecture consisting of several NameNodes to resolve all the above problems.
Keywords :
Big Data; distributed databases; network operating systems; parallel processing; reliability; software architecture; Big Data; HDFS architecture; Hadoop distributed file system; SPOF; commodity hardware; distributed NameNode cluster; reliability; single point of failure; Distributed databases; Educational institutions; File systems; Hardware; Load management; Silicon; Synchronization; Distributed NameNodes; HDFS; Hadoop; High-Availability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Reliable Distributed Systems (SRDS), 2014 IEEE 33rd International Symposium on
Conference_Location :
Nara
Type :
conf
DOI :
10.1109/SRDS.2014.61
Filename :
6983409
Link To Document :
بازگشت