• DocumentCode
    3240276
  • Title

    Applying Chimera Virtual Data Concepts to Cluster Finding in the Sloan Sky Survey

  • Author

    Annis, James ; Zhao, Yong ; Voeckler, Jens ; Wilde, Michael ; Kent, Steve ; Foster, Ian

  • Author_Institution
    Fermilab
  • fYear
    2002
  • fDate
    16-22 Nov. 2002
  • Firstpage
    56
  • Lastpage
    56
  • Abstract
    In many scientific disciplines — especially long running, data- intensive collaborations — it is important to track all aspects of data capture, production, transformation, and analysis. In principle, one can then audit, validate, reproduce, and/or re-run with corrections various data transformations. We have recently proposed and prototyped the Chimera virtual data system, a new database-driven approach to this problem. We present here a major application study in which we apply Chimera to a challenging data analysis problem: the identification of galaxy clusters within the Sloan Digital Sky Survey. We describe the problem, its computational procedures, and the use of Chimera to plan and orchestrate the workflow of thousands of tasks on a data grid comprising hundreds of computers. This experience suggests that a general set of tools can indeed enhance the accuracy and productivity of scientific data reduction and that further development and application of this paradigm will offer great value.
  • Keywords
    Astrophysics; Catalogs; Clustering algorithms; Collaboration; Computer science; Data systems; Distributed computing; Grid computing; Information retrieval; Mathematics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Supercomputing, ACM/IEEE 2002 Conference
  • ISSN
    1063-9535
  • Print_ISBN
    0-7695-1524-X
  • Type

    conf

  • DOI
    10.1109/SC.2002.10021
  • Filename
    1592892