Title :
A Machine Learning Approach to Resolving Incongruence in Molecular Phylogenies and Visualization Analysis
Author_Institution :
Department of Mathematics and Bioinformatics Program Eastern Michigan University Ypsilanti, MI 48197 USA
Abstract :
The incongruence between gene trees and species trees is one of the most pervasive challenges in molecular phylogenetics. In this work, a machine learning approach is proposed to overcome this problem. In the machine learning approach, the gene data set is clustered by a self-organizing map (SOM). Then a phylogenetically informative core gene set is created by combining the maximum entropy gene from each cluster to conduct phylogenetic analysis. Using the same data set, this approach performs better than the previous random gene concatenation method. The SOM based information visualization is also employed to compare the species patterns in the phylogenetic tree constructions.
Keywords :
Gene trees; clustering analysis; entropy; information visualization; self-organizing map; species trees; Bioinformatics; Data visualization; Entropy; Genomics; History; Machine learning; Mathematical model; Phylogeny; Sampling methods; Sequences; Gene trees; clustering analysis; entropy; information visualization; self-organizing map; species trees;
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology, 2005. CIBCB '05. Proceedings of the 2005 IEEE Symposium on
Print_ISBN :
0-7803-9387-2
DOI :
10.1109/CIBCB.2005.1594939