• DocumentCode
    2858652
  • Title

    Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments

  • Author

    JaJa, Joesph ; Smorul, Mike ; McCall, Fritz ; Wang, Yang

  • Author_Institution
    Inst. for Adv. Comput. Studies, Maryland Univ., College Park, MD, USA
  • fYear
    2005
  • fDate
    11-14 April 2005
  • Firstpage
    197
  • Lastpage
    201
  • Abstract
    Emerging technologies in high speed NAS, hierarchical storage management systems, and networked systems that virtualize interconnected storage over IP and fiber-channel networks, promise to consolidate distributed data stores onto large-scale professionally managed enterprise storage environments. We describe the software architecture of the PAWN (Producer - Archive Workflow Network) environment that enables scalable, reliable marshalling and organization of distributed data into such enterprise storage environments. PAWN was initially developed to capture the core elements required for long term preservation of digital objects as identified by researchers in the digital library and archiving communities. In this paper, we show how PAWN can be extended to enable multiple clients at a number of distributed sites to prepare, organize, and bulk transfer large scale data onto clusters of servers that securely verify the integrity of the data, register the metadata, and store the data into an enterprise storage environment. PAWN allows detailed description, auditing, and organization of the data, and hence will allow for efficient management, access, and disaster recovery. The basic software components are based on open standards and web technologies, and hence are platform independent.
  • Keywords
    business continuity; data integrity; digital libraries; distributed databases; meta data; software architecture; storage management; workstation clusters; data integrity; digital library; disaster recovery; distributed large scale data organization; enterprise storage environment; hierarchical storage management system; metadata register; producer-archive workflow network; software architecture; Collaboration; Computer architecture; Computer network management; Computer networks; Distributed computing; Educational institutions; Environmental management; Large-scale systems; Software architecture; Technology management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mass Storage Systems and Technologies, 2005. Proceedings. 22nd IEEE / 13th NASA Goddard Conference on
  • Print_ISBN
    0-7695-2318-8
  • Type

    conf

  • DOI
    10.1109/MSST.2005.29
  • Filename
    1410736