DocumentCode :
108056
Title :
Profile-Based LC-MS Data Alignment - A Bayesian Approach
Author :
Tsung-Heng Tsai ; Tadesse, Mahlet G. ; Yue Wang ; Ressom, Habtom W.
Author_Institution :
Dept. of Electr. & Comput. Eng., Georgetown Univ., Washington, DC, USA
Volume :
10
Issue :
2
fYear :
2013
fDate :
March-April 2013
Firstpage :
494
Lastpage :
503
Abstract :
A Bayesian alignment model (BAM) is proposed for alignment of liquid chromatography-mass spectrometry (LC-MS) data. BAM belongs to the category of profile-based approaches, which are composed of two major components: a prototype function and a set of mapping functions. Appropriate estimation of these functions is crucial for good alignment results. BAM uses Markov chain Monte Carlo (MCMC) methods to draw inference on the model parameters and improves on existing MCMC-based alignment methods through 1) the implementation of an efficient MCMC sampler and 2) an adaptive selection of knots. A block Metropolis-Hastings algorithm that mitigates the problem of the MCMC sampler getting stuck at local modes of the posterior distribution is used for the update of the mapping function coefficients. In addition, a stochastic search variable selection (SSVS) methodology is used to determine the number and positions of knots. We applied BAM to a simulated data set, an LC-MS proteomic data set, and two LC-MS metabolomic data sets, and compared its performance with the Bayesian hierarchical curve registration (BHCR) model, the dynamic time-warping (DTW) model, and the continuous profile model (CPM). The advantage of applying appropriate profile-based retention time correction prior to performing a feature-based approach is also demonstrated through the metabolomic data sets.
Keywords :
Bayes methods; Markov processes; Monte Carlo methods; biochemistry; bioinformatics; chromatography; inference mechanisms; mass spectroscopic chemical analysis; proteins; proteomics; statistical distributions; BHCR model; Bayesian alignment model; Bayesian hierarchical curve registration model; CPM; DTW model; LC-MS metabolomic data set; LC-MS proteomic data set; MCMC sampler; MCMC-based alignment method; Markov chain Monte Carlo method; SSVS methodology; block Metropolis-Hastings algorithm; continuous profile model; dynamic time-warping model; feature-based approach; inference mechanism; knot adaptive selection; liquid chromatography-mass spectrometry; mapping function coefficient; posterior distribution; profile-based LC-MS data alignment; profile-based retention time correction; prototype function; stochastic search variable selection methodology; Bayes methods; Chromatography-mass spectrometry; Monte Carlo methods; Stochastic processes; Alignment; Bayesian inference; Markov chain Monte Carlo (MCMC); block Metropolis-Hastings algorithm; liquid chromatography-mass spectrometry (LC-MS); stochastic search variable selection (SSVS); Algorithms; Bayes Theorem; Blood Proteins; Chromatography, Liquid; Computational Biology; Computer Simulation; Databases, Protein; Humans; Markov Chains; Mass Spectrometry; Metabolome; Monte Carlo Method;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2013.25
Filename :
6487487
Link To Document :
بازگشت