Title :
FEAST: Sensitive Local Alignment with Multiple Rates of Evolution
Author :
Hudek, Alexander K. ; Brown, Daniel G.
Author_Institution :
Sch. of Comput. Sci., Univ. of Waterloo, Waterloo, ON, Canada
Abstract :
We present a pairwise local aligner, FEAST, which uses two new techniques: a sensitive extension algorithm for identifying homologous subsequences, and a descriptive probabilistic alignment model. We also present a new procedure for training alignment parameters and apply it to the human and mouse genomes, producing a better parameter set for these sequences. Our extension algorithm identifies homologous subsequences by considering all evolutionary histories. It has higher maximum sensitivity than Viterbi extensions, and better balances specificity. We model alignments with several submodels, each with unique statistical properties, describing strongly similar and weakly similar regions of homologous DNA. Training parameters using two submodels produces superior alignments, even when we align with only the parameters from the weaker submodel. Our extension algorithm combined with our new parameter set achieves sensitivity 0.59 on synthetic tests. In contrast, LASTZ with default settings achieves sensitivity 0.35 with the same false positive rate. Using the weak submodel as parameters for LASTZ increases its sensitivity to 0.59 with high error. FEAST is available at http://monod.uwaterloo.ca/feast/.
Keywords :
DNA; biology computing; genomics; molecular biophysics; statistical distributions; FEAST extension algorithm; Viterbi extension; gene subsequence; homologous DNA; human genomes; mouse genomes; probabilistic alignment model; statistical properties; Bioinformatics; Genomics; Hidden Markov models; Humans; Mice; Training; Viterbi algorithm; HMM; biology and genetics.; local alignment; sequence evolution; Algorithms; Animals; Artificial Intelligence; Computational Biology; DNA; Evolution, Molecular; Genome, Human; Humans; Markov Chains; Mice; Models, Genetic; ROC Curve; Sequence Alignment; Sequence Homology, Nucleic Acid; Software;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2010.76