DocumentCode :
3290956
Title :
Evolutionary Trends in a Supercomputing Tertiary Storage Environment
Author :
Frank, Joel C. ; Miller, Ethan L. ; Adams, Ian F. ; Rosenthal, Daniel C.
Author_Institution :
Storage Syst. Res. Center, Univ. of California, Santa Cruz, CA, USA
fYear :
2012
fDate :
7-9 Aug. 2012
Firstpage :
411
Lastpage :
419
Abstract :
Tracking archival usage and data migration in a long term supercomputing system is critical to understanding not only how users´ needs and habits have changed over time, but also how the archive itself evolves in response to these external factors. Yet this type of study has not previously been performed. To address this need, we conducted an in-depth comparison of user initiated file activity on the mass storage system (MSS) at the National Center for Atmospheric Research (NCAR) during two periods, one in the early 1990s, and another nearly twenty years later. In addition to confirming earlier findings, our analysis turned up three surprising results. First, the read: write ratio went from 2:1 in the earlier trace to 1:2 in the later trace, a reduction of a factor of four in reads relative to writes. Second, only 30% of the current archive was accessed during the three year period of the study, in stark contrast to the 80% seen in the 1992 trace analysis. Third, access latency to the first byte of data actually got slower despite much faster computers and storage devices. These findings indicate that archival behavior has shifted towards a write-heavy workload, and that future archives can be more optimized for write activity than previously believed. Furthermore it may be worth considering the value of data being archived when it is stored, since later retrieval is increasingly less likely.
Keywords :
evolutionary computation; information retrieval systems; parallel machines; storage management; MSS; archival usage tracking; data migration; data retrieval; data storage device; evolutionary computation; mass storage system; optimization; read-write ratio; supercomputing tertiary storage; user initiated file activity; write activity; Bandwidth; Computers; Data models; Hardware; Market research; Meteorology; Organizations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Modeling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), 2012 IEEE 20th International Symposium on
Conference_Location :
Washington, DC
ISSN :
1526-7539
Print_ISBN :
978-1-4673-2453-3
Type :
conf
DOI :
10.1109/MASCOTS.2012.53
Filename :
6298201
Link To Document :
بازگشت