DocumentCode :
2860411
Title :
CHMasters: A Scalable and Speed-Efficient Metadata Service in Distributed File System
Author :
Xu, Min ; Zhou, Junrui ; Zhou, Wei ; An, Hong
Author_Institution :
Dept. of Comput. Sci. & Technol., Univ. of Sci. & Technol. of China, Hefei, China
fYear :
2011
fDate :
20-22 Oct. 2011
Firstpage :
394
Lastpage :
399
Abstract :
Distributed file system (DFS) is playing important roles of supporting large distributed data-intensive applications to meet storage needs. Typically, the design of DFS, such as GFS in Google, DMS in Cisco and TFS in Alibaba, is driven by observations of specific application workloads, internal demands and technological environment. In such systems, the metadata service is a critical factor that can affect the file system performance and availability to a great degree. Five requirements have been summarized for the metadata service: location transparent file service, smart director, efficient speed, strong scalability and friendly collaborator. In this paper, we present metadata service module called CH Masters in our DFS. Consistent hashing protocol is used to relieve potential hot spots on name servers. Files\´ metadata and master nodes are mapped into the same hash space by consistent hash function. And then files\´ metadata are scattered to master nodes by clockwise "closest" principle. Chunk server acts as a client when report its chunks info. Only a small proportion of files\´ metadata will be rehashed when master nodes state change. A new scalable file mapping strategy is also proposed to map file sizes from few MB to several GB efficiently. After intensive experiments, it shows CH Masters is satisfying the above five requirements.
Keywords :
distributed processing; file organisation; meta data; Alibaba; CH Masters; CHMasters; Cisco; DMS; GFS; Google; TFS; chunk server; consistent hash function; consistent hashing protocol; distributed file system; efficient speed requirement; friendly collaborator requirement; location transparent file service requirement; metadata service module; smart director requirement; strong scalability requirement; Clocks; Computer architecture; File systems; Google; Protocols; Servers; Testing; DFS; consistent hashing; mapping strategy; metadata service;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Computing, Applications and Technologies (PDCAT), 2011 12th International Conference on
Conference_Location :
Gwangju
Print_ISBN :
978-1-4577-1807-6
Type :
conf
DOI :
10.1109/PDCAT.2011.26
Filename :
6118550
Link To Document :
بازگشت