Title :
A link prediction based unsupervised rank aggregation algorithm for informative gene selection
Author :
Li, Kang ; Du, Nan ; Zhang, Aidong
Author_Institution :
Dept. of Comput. Sci. & Eng., State Univ. of New York at Buffalo, Buffalo, NY, USA
Abstract :
Informative Gene Selection is the process of identifying relevant genes that are significantly and differentially expressed in biological procedures. The microarray experiments conducted for this purpose usually implement only less than a hundred of samples to rank the relevance of over thousands of genes. Many irrelevant genes thus may gain statistical importance due to the randomness caused by the small sample problem, while relevant genes may lose focus in the same way. Overcoming such a problem goes beyond what a single microarray dataset can offer and stresses the use of multiple experiment results, which is defined as rank aggregation. In this paper, we propose a novel link prediction based rank aggregation algorithm for the purpose of informative gene selection. Each rank is transferred into a fully connected and weighted network, in which the nodes represent genes and the weights of links stand for priorities between connected nodes (genes). The integration of multiple gene ranks is then formulated as an optimization problem of link prediction on multiple networks, with criterion function favoring the maximization of weighted consensus among each network. We solve the problem through iterative estimation of weights and maximization of consensus among them. In the experimental evaluation, we demonstrate our method on the Prostate Cancer Dataset and compare it with other baseline methods. The results show that our link prediction based rank aggregation method remarkably outperforms all the compared methods, which proves the effectiveness of our framework in finding informative genes from multiple microarray experimental results.
Keywords :
bioinformatics; cancer; complex networks; genetics; iterative methods; medical computing; molecular biophysics; optimisation; connected node priorities; criterion function; fully connected weighted network; informative gene selection; iterative weight estimation; link prediction based rank aggregation algorithm; link prediction optimization problem; microarray experiments; multiple gene ranks; prostate cancer dataset; relevant gene identification; small sample problem; unsupervised rank aggregation algorithm; weighted consensus maximization; Estimation; Linear programming; Measurement; Mutual information; Optimization; Predictive models; Prostate cancer; Informative Gene Selection; Link Prediction; Rank Aggregation;
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-1-4673-2559-2
Electronic_ISBN :
978-1-4673-2558-5
DOI :
10.1109/BIBM.2012.6392697