DocumentCode
1903509
Title
The design and implementation of a database for human genome research
Author
Sargent, Rob ; Fuhrman, Dave ; Critchlow, Terence ; Sera, Tony Di ; Mecklenburg, Robert ; Lindstrom, Gary ; Cartwright, Peter
Author_Institution
Utah Center for Human Genome Res., Salt Lake City, UT, USA
fYear
1996
fDate
18-20 Jun 1996
Firstpage
220
Lastpage
225
Abstract
The Human Genome Project poses severe challenges in database design and implementation. These include comprehensive coverage of diverse data domains and user constituencies; robustness in the presence of incomplete, inconsistent and multi-version data; accessibility through many levels of abstraction, and scalability in content and organizational complexity. The paper presents a new data model developed to meet these challenges by the Utah Center for Human Genome Research. The central characteristics are: (i) a high level data model comprising five broadly applicable workflow notions; (ii) representation of those notions as objects in an extended relational model; (iii) expression of working database schemas as meta data in administration tables; (iv) population of the database through tables dependent on the meta data tables; and (v) implementation via a conventional relational database management system. The authors explore two advantages of this approach: the resulting representational flexibility, and the reflective use of meta data to accomplish schema evolution by ordinary updates. Implementation and performance pragmatics of this work are sketched, as well as implications for future database development
Keywords
biology computing; cellular biophysics; data structures; genetics; molecular biophysics; object-oriented databases; relational databases; scientific information systems; Human Genome Project; Utah Center for Human Genome Research; abstraction; administration tables; content scalability; data model; database design; database implementation; database population; diverse data domains; diverse user constituencies; extended relational model; high level data model; incomplete data; inconsistent data; meta data; meta data tables; multi-version data; organizational complexity scalability; relational database management system; workflow notions; working database schemas; Bioinformatics; Cities and towns; Computer science; Data models; Genomics; Humans; Informatics; Laboratories; Protocols; Relational databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Scientific and Statistical Database Systems, 1996. Proceedings., Eighth International Conference on
Conference_Location
Stockholm
Print_ISBN
0-8186-7264-1
Type
conf
DOI
10.1109/SSDM.1996.506064
Filename
506064
Link To Document