DocumentCode
2413471
Title
A two-stage machine learning approach for pathway analysis
Author
Zhang, Wei ; Emrich, Scott ; Zeng, Erliang
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of Notre Dame, Notre Dame, IN, USA
fYear
2010
fDate
18-21 Dec. 2010
Firstpage
274
Lastpage
279
Abstract
Analysis of gene expression data has emerged as an important approach to discover active pathways related to biological phenotypes. Previous pathway analysis methods use all genes in a pathway for linking it to a particular phenotype. Using only a subset of informative genes, however, could better classify samples. Here, we propose a two-stage machine learning approach for pathway analysis. During the first stage, informative genes that can represent a pathway are selected using feature selection methods. These “representative genes” are mostly associated with the phenotype of interest. In the second stage, pathways are ranked based on their “representative genes” using classification methods. We applied our two-stage approach on three gene expression datasets. The results indicate our method does outperform methods that consider every gene in a pathway.
Keywords
bioinformatics; biological techniques; cellular biophysics; feature extraction; genetics; learning (artificial intelligence); pattern classification; active pathway discovery; biological phenotypes; classification methods; feature selection methods; gene expression data analysis; informative genes; pathway analysis; two stage machine learning approach; Bioinformatics; Breast cancer; Error analysis; Gene expression; Machine learning; Redundancy; Support vector machines; Classification; Feature Selection; Gene expression; Machine Learning; Pathway Analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on
Conference_Location
Hong Kong
Print_ISBN
978-1-4244-8306-8
Electronic_ISBN
978-1-4244-8307-5
Type
conf
DOI
10.1109/BIBM.2010.5706576
Filename
5706576
Link To Document