DocumentCode
3714374
Title
Imputation of ChIP-seq datasets via Low Rank Convex Co-Embedding
Author
Lin Zhu; Wei-Li Guo; De-Shuang Huang; Can-Yi Lu
Author_Institution
College of Electronics and Information Engineering, Tongji University, Shanghai, China
fYear
2015
Firstpage
141
Lastpage
144
Abstract
In recent years, thanks to the efforts of individual scientists and research consortiums, a huge amount of chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) experimental data have been accumulated. Although several recent studies have demonstrated that a wealth of insights can be gained by integrative analysis of these data, owing to cost, time or sample material availability, it is not always possible for researchers to obtain binding profiles for every proteins in every sample of interest, which considerably limits the power of integrative studies. In this paper, we propose a novel method called Low Rank Convex Co-Embedding (LRCCE) for imputing new ChIP-seq datasets. In LRCCE, a diverse collection of available ChIP-seq data are fused together by mapping proteins, samples, and genomic positions simultaneously into the Euclidean space, thereby making their underling associations directly evaluable using simple calculations. In contrast with previous approaches which mainly use of the local correlations between available datasets, LRCCE can better estimate the overall data structure by formulating the representation learning of all involved entities as a single unified optimization problem. Experimental evaluations on the ENCODE data illustrate the usefulness of the proposed model.
Keywords
"Genomics","Bioinformatics","Sequential analysis","Proteins","Manganese"
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine (BIBM), 2015 IEEE International Conference on
Type
conf
DOI
10.1109/BIBM.2015.7359671
Filename
7359671
Link To Document