• DocumentCode
    1903509
  • Title

    The design and implementation of a database for human genome research

  • Author

    Sargent, Rob ; Fuhrman, Dave ; Critchlow, Terence ; Sera, Tony Di ; Mecklenburg, Robert ; Lindstrom, Gary ; Cartwright, Peter

  • Author_Institution
    Utah Center for Human Genome Res., Salt Lake City, UT, USA
  • fYear
    1996
  • fDate
    18-20 Jun 1996
  • Firstpage
    220
  • Lastpage
    225
  • Abstract
    The Human Genome Project poses severe challenges in database design and implementation. These include comprehensive coverage of diverse data domains and user constituencies; robustness in the presence of incomplete, inconsistent and multi-version data; accessibility through many levels of abstraction, and scalability in content and organizational complexity. The paper presents a new data model developed to meet these challenges by the Utah Center for Human Genome Research. The central characteristics are: (i) a high level data model comprising five broadly applicable workflow notions; (ii) representation of those notions as objects in an extended relational model; (iii) expression of working database schemas as meta data in administration tables; (iv) population of the database through tables dependent on the meta data tables; and (v) implementation via a conventional relational database management system. The authors explore two advantages of this approach: the resulting representational flexibility, and the reflective use of meta data to accomplish schema evolution by ordinary updates. Implementation and performance pragmatics of this work are sketched, as well as implications for future database development
  • Keywords
    biology computing; cellular biophysics; data structures; genetics; molecular biophysics; object-oriented databases; relational databases; scientific information systems; Human Genome Project; Utah Center for Human Genome Research; abstraction; administration tables; content scalability; data model; database design; database implementation; database population; diverse data domains; diverse user constituencies; extended relational model; high level data model; incomplete data; inconsistent data; meta data; meta data tables; multi-version data; organizational complexity scalability; relational database management system; workflow notions; working database schemas; Bioinformatics; Cities and towns; Computer science; Data models; Genomics; Humans; Informatics; Laboratories; Protocols; Relational databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Scientific and Statistical Database Systems, 1996. Proceedings., Eighth International Conference on
  • Conference_Location
    Stockholm
  • Print_ISBN
    0-8186-7264-1
  • Type

    conf

  • DOI
    10.1109/SSDM.1996.506064
  • Filename
    506064