DocumentCode
3730181
Title
Gene-disease association through topological and biological feature integration
Author
Eileen Marie Hanna;Nazar M. Zaki
Author_Institution
College of Information Technology, United Arab Emirates University, Al Ain 17551, UAE
fYear
2015
Firstpage
225
Lastpage
229
Abstract
The large amounts of biological information generated using advanced high-throughput experimental techniques continue to motivate the design of suitable methods for valuable knowledge mining. Finding proper means to examine and analyze such information allows better understanding of normal biological processes as well as uncovering malfunctions that trigger various diseases. Several computational approaches were developed to complement the experimental work which is often restricted by high time and cost requirements. In this paper, we consider the problem of disease- gene association and we propose a methodology based on a classification approach which integrates protein-protein interaction network topology features and biological information collected from various data sources. When applied on a dataset of multiple disease types and using the Naive Bayes classifier, our method achieves an AUC score of 0.941. We also consider two case studies of type II diabetes mellitus and breast cancer. The experimental results greatly favor our approach.
Keywords
"Diseases","Proteins","Diabetes","Information technology","Biological processes","Network topology"
Publisher
ieee
Conference_Titel
Innovations in Information Technology (IIT), 2015 11th International Conference on
Print_ISBN
978-1-4673-8509-1
Type
conf
DOI
10.1109/INNOVATIONS.2015.7381544
Filename
7381544
Link To Document