DocumentCode
3198756
Title
An ensemble rank learning approach for gene prioritization
Author
Po-Feng Lee ; Von-Wun Soo
Author_Institution
Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
fYear
2013
fDate
3-7 July 2013
Firstpage
3507
Lastpage
3510
Abstract
Several different computational approaches have been developed to solve the gene prioritization problem. We intend to use the ensemble boosting learning techniques to combine variant computational approaches for gene prioritization in order to improve the overall performance. In particular we add a heuristic weighting function to the Rankboost algorithm according to: 1) the absolute ranks generated by the adopted methods for a certain gene, and 2) the ranking relationship between all gene-pairs from each prioritization result. We select 13 known prostate cancer genes in OMIM database as training set and protein coding gene data in HGNC database as test set. We adopt the leave-one-out strategy for the ensemble rank boosting learning. The experimental results show that our ensemble learning approach outperforms the four gene-prioritization methods in ToppGene suite in the ranking results of the 13 known genes in terms of mean average precision, ROC and AUC measures.
Keywords
cancer; genetics; genomics; heuristic programming; learning (artificial intelligence); medical computing; proteins; AUC measure; HGNC database; OMIM database; ROC measure; Rankboost algorithm; ToppGene suite; absolute ranks; ensemble rank boosting learning techniques; gene-pairs; gene-prioritization method; heuristic weighting function; leave-one-out strategy; mean average precision; prostate cancer genes; protein coding gene data; ranking relationship; training set; variant computational approaches; Algorithm design and analysis; Bioinformatics; Boosting; Databases; Diseases; Markov processes; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Engineering in Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the IEEE
Conference_Location
Osaka
ISSN
1557-170X
Type
conf
DOI
10.1109/EMBC.2013.6610298
Filename
6610298
Link To Document