Title :
The Large Scale Data Facility: Data Intensive Computing for Scientific Experiments
Author :
García, Ariel O. ; Bourov, Serguei ; Hammad, Ahmad ; Van Wezel, Jos ; Neumair, Bernhard ; Streit, Achim ; Hartmann, Volker ; Jejkal, Thomas ; Neuberger, Patrick ; Stotzka, Rainer
Author_Institution :
Steinbuch Centre for Comput., Karlsruhe Inst. of Technol. (KIT), Karlsruhe, Germany
Abstract :
The Large Scale Data Facility (LSDF) at the Karlsruhe Institute of Technology was started end of 2009 with the aim of supporting the growing requirements of data intensive experiments. In close cooperation with the involved scientific communities, the LSDF provides them not only with adequate storage space but with a directly attached analysis farm and - more importantly - with value added services for their big scientific data-sets. Analysis workflows are supported through the mixed Hadoop and Open Nebula Cloud environments directly attached to the storage, and enable the efficient processing of the experimental data. Metadata handling is a central part of the LSDF, where a metadata repository, community specific metadata schemes, graphical tools, and APIs were developed for accessing and efficiently organizing the stored data-sets.
Keywords :
cloud computing; computer graphics; data handling; meta data; storage management; API; Karlsruhe Institute of Technology; OpenNebula cloud environment; adequate data storage space; community specific metadata scheme; data intensive computing; graphical tools; large scale data facility; metadata handling; metadata repository; mixed Hadoop cloud environment; scientific communities; scientific data set; value added service; Communities; Data visualization; Memory; Microscopy; Protocols; Software; Throughput;
Conference_Titel :
Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-425-1
Electronic_ISBN :
1530-2075
DOI :
10.1109/IPDPS.2011.286