DocumentCode :
589121
Title :
Coupled Matrix Factorization with Sparse Factors to Identify Potential Biomarkers in Metabolomics
Author :
Acar, Esra ; Gurdeniz, G. ; Rasmussen, Morten A. ; Rago, D. ; Dragsted, L.O. ; Bro, Rasmus
Author_Institution :
Dept. of Food Sci., Univ. of Copenhagen, Copenhagen, Denmark
fYear :
2012
fDate :
10-10 Dec. 2012
Firstpage :
1
Lastpage :
8
Abstract :
Metabolomics focuses on the detection of chemical substances in biological fluids such as urine and blood using a number of analytical techniques including Nuclear Magnetic Resonance (NMR) spectroscopy and Liquid Chromatography-Mass Spectroscopy (LC-MS). Among the major challenges in analysis of metabolomics data are (i) joint analysis of data from multiple platforms and (ii) capturing easily interpretable underlying patterns, which could be further utilized for biomarker discovery. In order to address these challenges, we formulate joint analysis of data from multiple platforms as a coupled matrix factorization problem with sparsity constraints on the factor matrices. We develop an all-at-once optimization algorithm, called CMF-SPOPT (Coupled Matrix Factorization with SParse Optimization), which is a gradient-based optimization approach solving for all factor matrices simultaneously. Using numerical experiments on simulated data, we demonstrate that CMF-SPOPT can capture the underlying sparse patterns in data. Furthermore, on a real data set of blood samples collected from a group of rats, we use the proposed approach to jointly analyze metabolomic data sets and identify potential biomarkers for apple intake.
Keywords :
biochemistry; blood; chromatography; gradient methods; mass spectroscopy; matrix decomposition; nuclear magnetic resonance; optimisation; CMF-SPOPT; LC-MS; NMR spectroscopy; all-at-once optimization algorithm; apple intake; biological fluids; biomarker discovery; blood samples; chemical substance detection; coupled matrix factorization with sparse optimization; factor matrices; gradient-based optimization approach; interpretable underlying patterns; liquid chromatography-mass spectroscopy; metabolomic data analysis; nuclear magnetic resonance spectroscopy; numerical experiments; potential biomarker identification; rats; simulated data; sparsity constraints; Data models; Joints; Metabolomics; Nuclear magnetic resonance; Optimization; Sparse matrices; Coupled matrix factorization; gradient-based optimization; metabolomics; missing data; sparsity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-5164-5
Type :
conf
DOI :
10.1109/ICDMW.2012.17
Filename :
6406416
Link To Document :
بازگشت