Title :
Mutual Information Optimization for Mass Spectra Data Alignment
Author :
Zoppis, I. ; Gianazza, E. ; Borsani, M. ; Chinello, C. ; Mainini, V. ; Galbusera, C. ; Ferrarese, C. ; Galimberti, G. ; Sorbi, S. ; Borroni, B. ; Magni, F. ; Antoniotti, M. ; Mauri, G.
Author_Institution :
Dept. of Inf., Univ. of Milano-Bicocca, Milan, Italy
Abstract :
"Signal” alignments play critical roles in many clinical setting. This is the case of mass spectrometry (MS) data, an important component of many types of proteomic analysis. A central problem occurs when one needs to integrate (MS) data produced by different sources, e.g., different equipment and/or laboratories. In these cases, some form of "data integration” or "data fusion” may be necessary in order to discard some source-specific aspects and improve the ability to perform a classification task such as inferring the "disease classes” of patients. The need for new high-performance data alignments methods is therefore particularly important in these contexts. In this paper, we propose an approach based both on an information theory perspective, generally used in a feature construction problem, and the application of a mathematical programming task (i.e., the weighted bipartite matching problem). We present the results of a competitive analysis of our method against other approaches. The analysis was conducted on data from plasma/ethylenediaminetetraacetic acid of "control” and Alzheimer patients collected from three different hospitals. The results point to a significant performance advantage of our method with respect to the competing ones tested.
Keywords :
data integration; diseases; mass spectroscopy; mathematical programming; medical computing; medical information systems; proteomics; sensor fusion; Alzheimer patients; competitive analysis; data fusion; data integration; disease classes; feature construction problem; high-performance data alignments methods; mass spectra data alignment; mass spectrometry data; mathematical programming task; mutual information optimization; plasma-ethylenediaminetetraacetic acid; proteomic analysis; Bioinformatics; Diseases; Mutual information; Optimization; Peptides; Power capacitors; Proteins; Optimization; data integration; graph algorithms.; information theory; medical informatics; medicine; proteomics; Alzheimer Disease; Biological Markers; Blood Proteins; Case-Control Studies; Databases, Protein; Humans; Information Theory; Mass Spectrometry; Proteome; Proteomics; Signal Transduction;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2011.80