Title :
A Markov Chain Monte Carlo Sampling Relevance Vector Machine Model for Recognizing Transcription Start Sites
Author :
Juncai, Huang ; Fengbi, Wang ; Huanzhang, Mao ; Mingtian, Zhou
Author_Institution :
Coll. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
Abstract :
The task of finding transcription start sites (TSSs) can be modeled as a classification problem. Relevance vector machines (RVM) is a family of machine learning methods that represent a Bayesian approach to the training of general linear models (GLM). Based on the Markov-chain Monte Carlo(MCMC) sampler, propose a model for using the RVM to explore very large numbers of candidate features. The model applyes the power of the RVM to classifying and detecting interesting points and regions in biological sequence data. The model has been used successfully for testing predicting transcription start sites and other features in genome sequences. Our experimental results on real nucleotide sequences data show that our method improve the prediction accuracy greatly and our method performs significantly better than Promoter Inspector and CpG islands.
Keywords :
Markov processes; Monte Carlo methods; belief networks; genomics; learning (artificial intelligence); pattern classification; sampling methods; support vector machines; Bayesian approach; Markov chain; Monte Carlo sampling; general linear model; genome sequence; machine learning method; relevance vector machine model; transcription start sites recognization; Bioinformatics; Biological system modeling; DNA; Data models; Genomics; Markov processes; Training; Markov-chain Monte Carlo sampler; Relevance vector machines; candidate feature; recognizing TSSs; style;
Conference_Titel :
Artificial Intelligence and Computational Intelligence (AICI), 2010 International Conference on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-8432-4
DOI :
10.1109/AICI.2010.277