Title :
Discovering Genes-Diseases Associations From Specialized Literature Using the Grid
Author :
Faro, Alberto ; Giordano, Daniela ; Maiorana, Francesco ; Spampinato, Concetto
Author_Institution :
Dipt. di Ing. Inf. e Telecomun., Univ. di Catania, Catania, Italy
fDate :
7/1/2009 12:00:00 AM
Abstract :
This paper proposes a novel method for text mining on the Grid, aimed at pointing out hidden relationships for hypothesis generation and suitable for semi-interactive querying. The method is based on unsupervised clustering and the outputs are visualized with contextual information. Grid implementation is crucial for feasibility. We demonstrate it with a mining run for discovering genes-diseases associations from bibliographic sources and annotated databases. The proposed methodology is in view of a Grid architecture specialized in bioinformatics mining tasks. Some performance considerations are provided.
Keywords :
bioinformatics; data mining; diseases; genetics; Grid architecture; bioinformatics; data mining; genes-disease associations; Genes-diseases association; Grid; knowledge discovery; text mining; unsupervised clustering; Algorithms; Cluster Analysis; Computational Biology; Cystic Fibrosis; Databases, Bibliographic; Disease; Genetic Predisposition to Disease;
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
DOI :
10.1109/TITB.2008.2007755