DocumentCode
3542635
Title
Joint corresponding feature identification and alignment for multiple LC/MS replicates
Author
Cui, Jian ; Ma, Xuepo ; Zhang, Jianqiu
Author_Institution
Dept. of Electr. & Comput. Eng., Univ. of Texas at San Antonio, San Antonio, TX, USA
fYear
2011
fDate
4-6 Dec. 2011
Firstpage
108
Lastpage
111
Abstract
In Liquid Chromatography/Mass Spectrometry (LC-MS), identifying corresponding peptide features (LC peaks) in multiple replicate datasets plays a crucial role in the differential analysis of complex peptide or protein samples for biomarker discovery. Given a peptide sequence, we aim at identifying its LC peak intervals in all datasets simultaneously. Generally, features are first identified in each replicate dataset, and then the features are aligned using warping functions. In such a procedure, the error in feature identification will propagate to alignment. Instead, we consider the problem of joint feature identification and alignment in multiple datasets. Since accurate feature identification improves the accuracy of corresponding feature alignment and vice versa, joint processing provides better performance than separate processing. We propose an algorithm which combines peak identification quality scores, time shifts and the similarity of LC peak shapes between candidate corresponding features for accurate alignment. In addition, we also incorporate the approximate elution time interval of a peptide stored in an Accurate Time and Mass (ATM) database when available. We test our algorithm on publicly available datasets, and we compare its with that of separate feature identification and alignment. Results show that the number of accurately identified corresponding features is improved significantly by using the proposed method.
Keywords
bioinformatics; chromatography; computerised instrumentation; mass spectroscopy; proteins; LC peak intervals; LC peak shape similarity; accurate time-and-mass database; approximate elution time interval; biomarker discovery; complex peptide differential analysis; joint corresponding feature alignment; joint corresponding feature identification; liquid chromatography-mass spectrometry; multiple LC-MS replicates; multiple replicate datasets; peak identification quality scores; peptide feature identification; protein sample differential analysis; time shifts; warping functions; Accuracy; Databases; Feature extraction; Histograms; Joints; Peptides; Shape;
fLanguage
English
Publisher
ieee
Conference_Titel
Genomic Signal Processing and Statistics (GENSIPS), 2011 IEEE International Workshop on
Conference_Location
San Antonio, TX
ISSN
2150-3001
Print_ISBN
978-1-4673-0491-7
Electronic_ISBN
2150-3001
Type
conf
DOI
10.1109/GENSiPS.2011.6169456
Filename
6169456
Link To Document