Title :
Comparison of Tree-Child Phylogenetic Networks
Author :
Cardona, Gabriel ; Rosselló, Francesc ; Valiente, Gabriel
Author_Institution :
Dept. of Math. & Comput. Sci., Univ. of the Balearic Islands, Palma de Mallorca, Spain
Abstract :
Phylogenetic networks are a generalization of phylogenetic trees that allow for the representation of non-tree-like evolutionary events, like recombination, hybridization, or lateral gene transfer. While much progress has been made to find practical algorithms for reconstructing a phylogenetic network from a set of sequences, all attempts to endorse a class of phylogenetic networks (strictly extending the class of phylogenetic trees) with a well-founded distance measure have, to the best of our knowledge and with the only exception of the bipartition distance on regular networks, failed so far. In this paper, we present and study a new meaningful class of phylogenetic networks, called tree-child phylogenetic networks, and we provide an injective representation of these networks as multisets of vectors of natural numbers, their path multiplicity vectors. We then use this representation to define a distance on this class that extends the well-known Robinson-Foulds distance for phylogenetic trees and to give an alignment method for pairs of networks in this class. Simple polynomial algorithms for reconstructing a tree-child phylogenetic network from its path multiplicity vectors, for computing the distance between two tree-child phylogenetic networks and for aligning a pair of tree-child phylogenetic networks, are provided. They have been implemented as a Perl package and a Java applet, which can be found at http://bioinfo.uib.es/~recerca/phylonetworks/mudistance/.
Keywords :
biocomputing; computational complexity; genetics; network theory (graphs); trees (mathematics); vectors; Java applet; Perl package; Robinson-Foulds distance; bipartition distance; lateral gene transfer; natural numbers; nontree-like evolutionary events; path multiplicity vectors; phylogenetic trees; polynomial algorithms; tree-child phylogenetic networks; Biology and genetics; Computer Applications; Discrete Mathematics; Graph Theory; Graph algorithms; Life and Medical Sciences; Mathematics of Computing; Phylogenetic network; Trees; network alignment.; partition distance; phylogenetic tree; tree-child phylogenetic network; Algorithms; Computational Biology; Computers; Data Interpretation, Statistical; Humans; Internet; Models, Genetic; Models, Statistical; Models, Theoretical; Phylogeny; Programming Languages; Sequence Alignment; Software;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2007.70270