• DocumentCode
    1831241
  • Title

    Long-term digital archiving based on selection of repositories over P2P networks

  • Author

    Vignatti, Tiago ; Bona, Luis C E ; Sunye, Marcos S. ; Vignatti, André L.

  • Author_Institution
    Dept. of Inf., Fed. Univ. of Parana, Brazil
  • fYear
    2009
  • fDate
    9-11 Sept. 2009
  • Firstpage
    194
  • Lastpage
    203
  • Abstract
    The importance of digital information is constantly increasing in the last years. Such information often needs to be preserved for a long-term and this is the responsibility of digital archiving systems. This paper proposes a reliable replication model of immutable digital content to be used in long-term archiving systems. The archiving system is modeled as a set of storage repositories where each repository has an independent fail probability assigned to it. Items are inserted with a reliability that is satisfied by replicating them in subsets of repositories. Through simulation, we evaluated three different proposed strategies to create replicas. It is also proposed a completely distributed archiving system using this model over a structured peer-to-peer (P2P) network. The communication between the nodes (repositories) of the network is organized in a distributed hash table and multiple hash functions are used to select repositories that will keep the replicas of each stored item. The system is evaluated through experiments in a real environment. The proposed model and the algorithms, combined with the structured P2P scalability made possible the construction of a reliable and totally distributed digital archiving system.
  • Keywords
    digital libraries; file organisation; information retrieval systems; peer-to-peer computing; probability; distributed archiving system; distributed hash table; immutable digital content; independent fail probability; long-term digital archiving system; multiple hash function; reliable replication model; storage repository selection; structured peer-to-peer network scalability; Computer networks; Electronic mail; Environmental economics; Hardware; Informatics; Peer to peer computing; Scalability; Software libraries; Telecommunication network reliability; Web and internet services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Peer-to-Peer Computing, 2009. P2P '09. IEEE Ninth International Conference on
  • Conference_Location
    Seattle, WA
  • Print_ISBN
    978-1-4244-5066-4
  • Electronic_ISBN
    978-1-4244-5067-1
  • Type

    conf

  • DOI
    10.1109/P2P.2009.5284519
  • Filename
    5284519