Title :
A POCS-based method for estimating unobserved values in microarray time-series data
Author :
Zeng, Jia ; Yan, Hong
Author_Institution :
Dept. of Electron. Eng., City Univ. of Hong Kong, Hong Kong
Abstract :
This paper presents a POCS-based (projection on convex set) method that estimates the unobserved time-points in microarray time-series data to make such data useful for clustering and aligning. Unobserved values are caused either by missing values or by unevenly sampling rates, and cannot be estimated accurately by straightforward interpolation due to very noisy and few replicated data. According to prior knowledge that each gene time-series is constrained in both time and frequency domains, POCS formulates these constraints by multiple convex sets and uses an iteratively convergent procedure to find the optimal value that satisfies all constraints by prior knowledge. To estimate the unobserved values, we use the cubic spline method to estimate the initial value and use POCS to find the optimal value iteratively. We show that POCS can improve the estimation of unobserved time-points with lower normalized root mean squared error compared with the statistical spline estimation for the continuous representation of microarray time-series data. Theoretically, the POCS-based method may improve the estimation performance further if more prior knowledge is available.
Keywords :
interpolation; pattern clustering; time series; cubic spline method; microarray time-series data; multiple convex sets; projection on convex set method; statistical spline estimation; straightforward interpolation; Data analysis; Data engineering; Frequency domain analysis; Frequency synchronization; Gene expression; Image reconstruction; Machine learning; Sampling methods; Spline; Time series analysis; Microarray time-series data; Missing values; Projection on convex set (POCS); Unevenly sampling rates;
Conference_Titel :
Machine Learning and Cybernetics, 2008 International Conference on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2095-7
Electronic_ISBN :
978-1-4244-2096-4
DOI :
10.1109/ICMLC.2008.4621084