• DocumentCode
    2192816
  • Title

    A conceptual framework for composing and managing scientific data lineage

  • Author

    Bose, Rajendra

  • Author_Institution
    Donald Bren Sch. of Environ. Sci. & Manage., California Univ., Santa Barbara, CA, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    15
  • Lastpage
    19
  • Abstract
    Scientific research relies as much on the dissemination and exchange of data sets as on the publication of conclusions. Accurately tracking the lineage (origin and subsequent processing history) of scientific data sets is thus imperative for the complete documentation of scientific work. However, the lack of a definitive data model for lineage, and the poor fit between current data management tools and scientific software, effectively prevent researchers front determining, preserving, or providing the lineage of the data products they use and create. Based on a comprehensive review of lineage-related research and previous prototype systems, a conceptual framework is presented to help identify and assess basic lineage system components. Within this framework, a direction is outlined for future work on general methods for composing and managing lineage for scientific data.
  • Keywords
    electronic data interchange; natural sciences computing; conceptual framework; data management; data set dissemination; data set exchange; documentation; processing history; scientific data lineage tracking; scientific research; scientific software; Assembly; Data models; Documentation; Environmental management; History; Pipelines; Prototypes; Software prototyping; Software tools; Yarn;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Scientific and Statistical Database Management, 2002. Proceedings. 14th International Conference on
  • ISSN
    1099-3371
  • Print_ISBN
    0-7695-1632-7
  • Type

    conf

  • DOI
    10.1109/SSDM.2002.1029701
  • Filename
    1029701