DocumentCode :
2858652
Title :
Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments
Author :
JaJa, Joesph ; Smorul, Mike ; McCall, Fritz ; Wang, Yang
Author_Institution :
Inst. for Adv. Comput. Studies, Maryland Univ., College Park, MD, USA
fYear :
2005
fDate :
11-14 April 2005
Firstpage :
197
Lastpage :
201
Abstract :
Emerging technologies in high speed NAS, hierarchical storage management systems, and networked systems that virtualize interconnected storage over IP and fiber-channel networks, promise to consolidate distributed data stores onto large-scale professionally managed enterprise storage environments. We describe the software architecture of the PAWN (Producer - Archive Workflow Network) environment that enables scalable, reliable marshalling and organization of distributed data into such enterprise storage environments. PAWN was initially developed to capture the core elements required for long term preservation of digital objects as identified by researchers in the digital library and archiving communities. In this paper, we show how PAWN can be extended to enable multiple clients at a number of distributed sites to prepare, organize, and bulk transfer large scale data onto clusters of servers that securely verify the integrity of the data, register the metadata, and store the data into an enterprise storage environment. PAWN allows detailed description, auditing, and organization of the data, and hence will allow for efficient management, access, and disaster recovery. The basic software components are based on open standards and web technologies, and hence are platform independent.
Keywords :
business continuity; data integrity; digital libraries; distributed databases; meta data; software architecture; storage management; workstation clusters; data integrity; digital library; disaster recovery; distributed large scale data organization; enterprise storage environment; hierarchical storage management system; metadata register; producer-archive workflow network; software architecture; Collaboration; Computer architecture; Computer network management; Computer networks; Distributed computing; Educational institutions; Environmental management; Large-scale systems; Software architecture; Technology management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mass Storage Systems and Technologies, 2005. Proceedings. 22nd IEEE / 13th NASA Goddard Conference on
Print_ISBN :
0-7695-2318-8
Type :
conf
DOI :
10.1109/MSST.2005.29
Filename :
1410736
Link To Document :
بازگشت