DocumentCode :
2909412
Title :
Evolutionary metropolis sampling in sequence alignment space
Author :
Bi, Chengpeng
Author_Institution :
Div. of Clinical Pharmacology, Children´´s Mercy Hosp., Kansas City, MO
fYear :
2008
fDate :
1-6 June 2008
Firstpage :
189
Lastpage :
194
Abstract :
Metropolis sampling is the earliest Markov chain Monte Carlo (MCMC) method and MCMC has been widely used in motif-finding via sequence local alignment. A key issue in the design of MCMC algorithms is to improve the proposal mechanism and the mixing behavior. To overcome these difficulties, it is common either to run a population of chains or incorporate the evolutionary computing techniques into the MCMC framework. This paper combines a simple evolutionary (genetic) algorithm (GA) with the metropolis sampler and proposes the new motif algorithm GAMS to carry out motif heuristic search throughout the multiple alignment space. GAMS first initializes a population of multiple local alignments (initial MCMC chains) each of which is encoded on a chromosome that represents a potential solution. GAMS then conducts a genetic algorithm-based search in the sequence alignment space using a motif scoring function as the fitness measure. The genetic algorithm gradually moves this population towards the best alignment from which the motif model is derived. Experimental results show that the new algorithm compares favorably to the simple multiple-run MCMC in some difficult cases, and it also exhibits higher precision than some popular motif-finding algorithms while testing on the annotated prokaryotic and eukaryotic binding sites data sets.
Keywords :
Markov processes; Monte Carlo methods; genetic algorithms; sampling methods; sequences; Markov chain Monte Carlo method; eukaryotic binding sites data sets; evolutionary computing techniques; evolutionary metropolis sampling; genetic algorithm; motif heuristic search; motif scoring function; motif-finding; prokaryotic binding sites data sets; sequence local alignment space; Bioinformatics; Bismuth; DNA; Gene expression; Genetics; Genomics; Machinery; Proteins; Sampling methods; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation, 2008. CEC 2008. (IEEE World Congress on Computational Intelligence). IEEE Congress on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1822-0
Electronic_ISBN :
978-1-4244-1823-7
Type :
conf
DOI :
10.1109/CEC.2008.4630797
Filename :
4630797
Link To Document :
بازگشت