• DocumentCode
    3006258
  • Title

    EDRFS: An Effective Distributed Replication File System for Small-File and Data-Intensive Application

  • Author

    Bin Cai ; Changsheng Xie ; Guangxi Zhu

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
  • fYear
    2007
  • fDate
    7-12 Jan. 2007
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    With the system scale keeping grown, the key challenge is to mask the failures that arise among the system components and to improve the performance of data-intensive applications. This paper designs and implements a cluster-based distributed replication file system EDRFS to meet these critical demands. EDRFS works with a single metadata server and multiple storage nodes, deploys whole-file replication scheme at the file level, and tracks what storage node a file is replicated on. We use a linear hash algorithm to evenly distribute data and load across multiple storage nodes so as to achieve balancing workload and incremental scalability of throughput and storage capacity as the system scale grows. In addition, we employ metadata caches and file data caches in clients to enhance system performance. Furthermore, we deploy a concurrency lock scheme to avoid namespace operation bottleneck and a replicas consistency method to keep a consistent mutation order among replicas of a file. We provide the initial experimental evaluations of our prototypical system on a small-file and data-intensive workload.
  • Keywords
    replicated databases; software reliability; storage area networks; EDRFS; cluster storage systems; data-intensive application; effective distributed replication file system; file data caches; metadata server; multiple storage nodes; namespace operation bottleneck; reliability; replicas consistency method; small-file application; Application software; Computer science; Concurrent computing; Costs; File servers; File systems; Genetic mutations; Laboratories; Scalability; Throughput; cluster storage systems; distributed systems; file systems; reliability; replication systemes; storage area network;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communication Systems Software and Middleware, 2007. COMSWARE 2007. 2nd International Conference on
  • Conference_Location
    Bangalore
  • Print_ISBN
    1-4244-0613-7
  • Type

    conf

  • DOI
    10.1109/COMSWA.2007.382422
  • Filename
    4268065