Title :
The design and implementation of a database for human genome research
Author :
Sargent, Rob ; Fuhrman, Dave ; Critchlow, Terence ; Sera, Tony Di ; Mecklenburg, Robert ; Lindstrom, Gary ; Cartwright, Peter
Author_Institution :
Utah Center for Human Genome Res., Salt Lake City, UT, USA
Abstract :
The Human Genome Project poses severe challenges in database design and implementation. These include comprehensive coverage of diverse data domains and user constituencies; robustness in the presence of incomplete, inconsistent and multi-version data; accessibility through many levels of abstraction, and scalability in content and organizational complexity. The paper presents a new data model developed to meet these challenges by the Utah Center for Human Genome Research. The central characteristics are: (i) a high level data model comprising five broadly applicable workflow notions; (ii) representation of those notions as objects in an extended relational model; (iii) expression of working database schemas as meta data in administration tables; (iv) population of the database through tables dependent on the meta data tables; and (v) implementation via a conventional relational database management system. The authors explore two advantages of this approach: the resulting representational flexibility, and the reflective use of meta data to accomplish schema evolution by ordinary updates. Implementation and performance pragmatics of this work are sketched, as well as implications for future database development
Keywords :
biology computing; cellular biophysics; data structures; genetics; molecular biophysics; object-oriented databases; relational databases; scientific information systems; Human Genome Project; Utah Center for Human Genome Research; abstraction; administration tables; content scalability; data model; database design; database implementation; database population; diverse data domains; diverse user constituencies; extended relational model; high level data model; incomplete data; inconsistent data; meta data; meta data tables; multi-version data; organizational complexity scalability; relational database management system; workflow notions; working database schemas; Bioinformatics; Cities and towns; Computer science; Data models; Genomics; Humans; Informatics; Laboratories; Protocols; Relational databases;
Conference_Titel :
Scientific and Statistical Database Systems, 1996. Proceedings., Eighth International Conference on
Conference_Location :
Stockholm
Print_ISBN :
0-8186-7264-1
DOI :
10.1109/SSDM.1996.506064