DocumentCode
667280
Title
Inference of a robust diagnostic signature in the case of Melanoma: Gene selection by information gain and Gene Ontology tree exploration
Author
Valavanis, Ioannis ; Moutselos, Konstantinos ; Maglogiannis, Ilias ; Chatziioannou, Aristotelis A.
Author_Institution
Inst. of Biol., Medicinal Chem. & Biotechnol., Nat. Hellenic Res. Found., Athens, Greece
fYear
2013
fDate
10-13 Nov. 2013
Firstpage
1
Lastpage
4
Abstract
Integrated datasets originating from multi-modal data can be used towards the identification of causal biological actions that through a systems level process trigger the development of a disease. We use, here, an integrated dataset related to cutaneous melanoma that comes from two separate sets (microarray and imaging) and the application of data imputation methods. Our goal is to select a subset of genes that comprise candidate biomarkers and compare these to imaging features, that characterize disease at a macroscopic level. Using information gain ratio measurements and exploration of Gene Ontology (GO) tree, we identified a set of 33 genes both highly correlated to the disease status and with a central role in regulatory mechanisms. Selected genes were used to train various classifiers that could generalize well when discriminating malignant from benign melanoma samples. Results showed that classifiers performed better when selected genes were used as input, rather than imaging features selected by information gain measurements. Thus, genes in the backstage of low-level biological processes showed to carry higher information content than the macroscopic imaging features.
Keywords
biology computing; decision trees; diseases; feature selection; genetics; inference mechanisms; ontologies (artificial intelligence); pattern classification; GO tree; benign melanoma sample discrimination; biomarkers; causal biological action identification; classifier training; cutaneous melanoma; data imputation methods; disease characterization; gene ontology tree; gene subset selection; imaging features; imaging sets; inference; information content; information gain ratio measurements; integrated datasets; low-level biological processes; macroscopic level; malignant melanoma sample discrimination; microarray sets; multimodal data; regulatory mechanisms; robust diagnostic signature; Cancer; Diseases; Gene expression; Imaging; Malignant tumors; Sensitivity; Skin;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Bioengineering (BIBE), 2013 IEEE 13th International Conference on
Conference_Location
Chania
Type
conf
DOI
10.1109/BIBE.2013.6701618
Filename
6701618
Link To Document