DocumentCode
2858652
Title
Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments
Author
JaJa, Joesph ; Smorul, Mike ; McCall, Fritz ; Wang, Yang
Author_Institution
Inst. for Adv. Comput. Studies, Maryland Univ., College Park, MD, USA
fYear
2005
fDate
11-14 April 2005
Firstpage
197
Lastpage
201
Abstract
Emerging technologies in high speed NAS, hierarchical storage management systems, and networked systems that virtualize interconnected storage over IP and fiber-channel networks, promise to consolidate distributed data stores onto large-scale professionally managed enterprise storage environments. We describe the software architecture of the PAWN (Producer - Archive Workflow Network) environment that enables scalable, reliable marshalling and organization of distributed data into such enterprise storage environments. PAWN was initially developed to capture the core elements required for long term preservation of digital objects as identified by researchers in the digital library and archiving communities. In this paper, we show how PAWN can be extended to enable multiple clients at a number of distributed sites to prepare, organize, and bulk transfer large scale data onto clusters of servers that securely verify the integrity of the data, register the metadata, and store the data into an enterprise storage environment. PAWN allows detailed description, auditing, and organization of the data, and hence will allow for efficient management, access, and disaster recovery. The basic software components are based on open standards and web technologies, and hence are platform independent.
Keywords
business continuity; data integrity; digital libraries; distributed databases; meta data; software architecture; storage management; workstation clusters; data integrity; digital library; disaster recovery; distributed large scale data organization; enterprise storage environment; hierarchical storage management system; metadata register; producer-archive workflow network; software architecture; Collaboration; Computer architecture; Computer network management; Computer networks; Distributed computing; Educational institutions; Environmental management; Large-scale systems; Software architecture; Technology management;
fLanguage
English
Publisher
ieee
Conference_Titel
Mass Storage Systems and Technologies, 2005. Proceedings. 22nd IEEE / 13th NASA Goddard Conference on
Print_ISBN
0-7695-2318-8
Type
conf
DOI
10.1109/MSST.2005.29
Filename
1410736
Link To Document