• DocumentCode
    3321606
  • Title

    Requirements of phylogenetic databases

  • Author

    Nakhleh, Luay ; Miranker, D.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Texas, Austin, TX, USA
  • fYear
    2003
  • fDate
    10-12 March 2003
  • Firstpage
    141
  • Lastpage
    148
  • Abstract
    We examine the organizational impact on phylogenetic databases of the increasing sophistication in the need and use of phylogenetic data. A primary issue is the use of the unnormalized representation of phylogenies in Newick format as a primitive data type in existing phylogenetic databases. In particular we identify and enumerate a list of potential applications of such databases and queries (use-cases) that biologists may wish to see integrated into a phylogenetic database management system. We show there are many queries that would best be supported by a normalized data model where phylogenies are stored as lists of edges. Since many of the queries require transitive traversals of the phylogenies we demonstrate, constructively, that complex phylogenetic queries can be conveniently constructed as Datalog programs. We address concerns with respect to the cost and performance of the normalized representation by developing and empirically evaluating a feasibility prototype.
  • Keywords
    biology computing; database management systems; genetics; Datalog programs; Newick format; edges list; evolutionary histories; feasibility prototype evaluation; normalized data model; normalized representation; organisms groups; phylogenetic database management system; phylogenetic databases requirements; primitive data type; transitive traversals; unnormalized representation; Bioinformatics; Biology computing; Computational biology; Data models; Database systems; Environmental factors; Evolution (biology); History; Phylogeny; Relational databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2003. Proceedings. Third IEEE Symposium on
  • Print_ISBN
    0-7695-1907-5
  • Type

    conf

  • DOI
    10.1109/BIBE.2003.1188940
  • Filename
    1188940