Title :
Transactional Database Transformation and Its Application in Prioritizing Human Disease Genes
Author :
Yang Xiang ; Payne, P.R.O. ; Kun Huang
Author_Institution :
Dept. of Biomed. Inf., Ohio State Univ., Columbus, OH, USA
Abstract :
Binary (0,1) matrices, commonly known as transactional databases, can represent many application data, including gene-phenotype data where "1” represents a confirmed gene-phenotype relation and "0” represents an unknown relation. It is natural to ask what information is hidden behind these "0”s and "1”s. Unfortunately, recent matrix completion methods, though very effective in many cases, are less likely to infer something interesting from these (0,1)-matrices. To answer this challenge, we propose Ind Evi, a very succinct and effective algorithm to perform independent-evidence-based transactional database transformation. Each entry of a (0,1)-matrix is evaluated by "independent evidence” (maximal supporting patterns) extracted from the whole matrix for this entry. The value of an entry, regardless of its value as 0 or 1, has completely no effect for its independent evidence. The experiment on a gene-phenotype database shows that our method is highly promising in ranking candidate genes and predicting unknown disease genes.
Keywords :
bioinformatics; diseases; genetics; binary (0,1) matrices; gene-phenotype database; human disease genes; independent evidence; matrix completion methods; maximal supporting patterns; transactional database transformation; Bioinformatics; Bipartite graph; Data mining; Diseases; Feature extraction; Itemsets; Transactional database; binary matrix; disease gene; frequent item set mining; matrix completion.; maximal biclique; phenotype; prioritization; Computational Biology; Databases, Genetic; Disease; Humans; Models, Genetic; Phenotype;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2011.58