DocumentCode :
463463
Title :
Near-Lossless Compression of Mass Spectra for Proteomics
Author :
Miguel, A.C. ; Kearney-Fischer, M. ; Keane, J.F. ; Whiteaker, J. ; Li-Chia Feng ; Paulovich, A.
Author_Institution :
Dept. of Electr. & Comput. Eng., Seattle Univ., WA, USA
Volume :
1
fYear :
2007
fDate :
15-20 April 2007
Abstract :
Recent improvements in mass spectrometry (MS) technology led to an explosive amount of MS data collected and shared. A typical liquid chromatography/mass spectrometry (LC/MS) "image" from the instrument used in this study consists of 4 GB of data. To reduce the bit rate required to code the MS data below that of the authors\´ previous (lossless) algorithm, we introduce a technique for near-lossless compression. It guarantees that each decompressed sample differs from its original value by no more than a user-specified quantity defined as the target maximum absolute distortion (MAD). We evaluate the proposed method by introducing feature-based metrics applied to the decompressed MS data and show that the MAD-based compression outperforms a traditional coding algorithm aimed at minimizing the mean squared error.
Keywords :
data compression; distortion; genetics; image coding; mass spectra; mean square error methods; medical image processing; proteins; liquid chromatography; mass spectra; mass spectrometry technology; maximum absolute distortion; mean squared error; near-lossless compression; proteomics; user-specified quantity; Biological information theory; Cancer; Data compression; Genomics; Image coding; Mass spectroscopy; Peptides; Proteins; Proteomics; Sequences; data compression; distortion; image coding; spectroscopy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366693
Filename :
4217093
Link To Document :
بازگشت