Title :
Gene Association Networks from Microarray Data Using a Regularized Estimation of Partial Correlation Based on PLS Regression
Author :
Tenenhaus, Arthur ; Guillemot, Vincent ; Gidrol, Xavier ; Frouin, Vincent
Author_Institution :
Lab. d´´Exploration Fonctionnelle des Genomes, Inst. de Radiobiologie Cellulaire et Moleculaire (iRCM), Evry, France
Abstract :
Reconstruction of gene-gene interactions from large-scale data such as microarrays is a first step toward better understanding the mechanisms at work in the cell. Two main issues have to be managed in such a context: 1) choosing which measures have to be used to distinguish between direct and indirect interactions from high-dimensional microarray data and 2) constructing networks with a low proportion of false-positive edges. We present an efficient methodology for the reconstruction of gene interaction networks in a small-sample-size setting. The strength of independence of any two genes is measured, in such "high-dimensional network," by a regularized estimation of partial correlation based on Partial Least Squares Regression. We finally emphasize specific properties of the proposed method. To assess the sensitivity and specificity of the method, we carried out the reconstruction of networks from simulated data. We also tested PLS-based partial correlation network on static and dynamic real microarray data. An R implementation of the proposed algorithm is available from http://biodev.extra.cea.fr/plspcnetwork/.
Keywords :
biology computing; cellular biophysics; genetics; molecular biophysics; gene association networks; gene interaction networks; gene-gene interactions; high-dimensional microarray data; partial correlation regularized estimation; partial least squares regression; Gene Association Networks; Gene association networks; Microarray; Partial Correlation; Partial Least Squares; Partial Least Squares Regression; high-dimensional data; local false discovery rate.; partial correlation; Algorithms; Area Under Curve; Computational Biology; Computer Simulation; Databases, Genetic; Escherichia coli; Gene Regulatory Networks; Genes, Bacterial; Genes, myc; Humans; Least-Squares Analysis; Oligonucleotide Array Sequence Analysis; Reproducibility of Results; Sensitivity and Specificity; Signal Transduction; T-Lymphocytes;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2008.87