• DocumentCode
    260326
  • Title

    Gene Networks Inference through Linear Grouping of Variables

  • Author

    Montoya-Cubas, Carlos Fernando ; Correa Martins, David ; Silva Santos, Carlos ; Barrera, Junior

  • Author_Institution
    Center of Math., Comput. & Cognition, Fed. Univ. of ABC, Santo Andre, Brazil
  • fYear
    2014
  • fDate
    10-12 Nov. 2014
  • Firstpage
    243
  • Lastpage
    250
  • Abstract
    The inference of gene networks from gene expression data is an open problem due to the large dimensionality (number of genes) and the small number of data samples typically available, even considering the fact that the network is sparse (limited number of input genes per target gene). In this work we propose a method that alleviates the curse of dimensionality by grouping predictor gene configurations in their respective linear combination values. Each linear combination value results in an equivalence class. In this way, the number of configurations of predictor values becomes a linear function of the dimensionality (number of predictors) instead of an exponential function when considering the original configurations. The proposed method follows the probabilistic gene networks approach which applies local feature selection to obtain an adequate predictor gene set for each gene. Even considering that some information from the original configurations of predictors is lost after applying the grouping, the results indicate that the inference with linear grouping tends to provide networks with better topological similarities than those obtained without grouping in cases where the number of samples is quite limited and the inference involves a larger number of predictors per gene.
  • Keywords
    bioinformatics; feature selection; genetics; genomics; probability; data samples; dimensionality; equivalence class; exponential function; gene expression data; gene network inference; linear function; linear variable grouping; local feature selection; open problem; original configurations; predictor gene configurations; predictor gene set; predictor values; probabilistic gene network approach; respective linear combination values; topological similarities; Barium; Biological system modeling; Erbium; Estimation; Gene expression; Probabilistic logic; Vectors; dimensionality reduction; feature selection; gene networks inference;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering (BIBE), 2014 IEEE International Conference on
  • Conference_Location
    Boca Raton, FL
  • Type

    conf

  • DOI
    10.1109/BIBE.2014.10
  • Filename
    7033588