• DocumentCode
    598576
  • Title

    Usage behavior of a large-scale scientific archive

  • Author

    Adams, Ian F. ; Madden, Brian A. ; Frank, Joel C. ; Storer, Mark W. ; Miller, Eric L. ; Harano, G.

  • fYear
    2012
  • fDate
    10-16 Nov. 2012
  • Firstpage
    1
  • Lastpage
    11
  • Abstract
    Archival storage systems for scientific data have been growing in both size and relevance over the past two decades, yet researchers and system designers alike must rely on limited and obsolete knowledge to guide archival management and design. To address this issue, we analyzed three years of filelevel activities from the NCAR mass storage system, providing valuable insight into a large-scale scientific archive with over 1600 users, tens of millions of files, and petabytes of data. Our examination of system usage showed that, while a subset of users were responsible for most of the activity, this activity was widely distributed at the file level. We also show that the physical grouping of files and directories on media can improve archival storage system performance. Based on our observations, we provide suggestions and guidance for both future scientific archival system designs as well as improved tracing of archival activity.
  • Keywords
    information retrieval systems; storage management; NCAR mass storage system; archival management; archival storage systems; large-scale scientific archive desihn; usage behavior; Aggregates; Data models; Drives; Electric breakdown; Hardware; Libraries; Media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    2167-4329
  • Print_ISBN
    978-1-4673-0805-2
  • Type

    conf

  • DOI
    10.1109/SC.2012.110
  • Filename
    6468456