DocumentCode :
81761
Title :
RANWAR: Rank-Based Weighted Association Rule Mining From Gene Expression and Methylation Data
Author :
Mallik, Saurav ; Mukhopadhyay, Anirban ; Maulik, Ujjwal
Author_Institution :
Machine Intell. Unit, Indian Stat. Inst., Kolkata, India
Volume :
14
Issue :
1
fYear :
2015
fDate :
Jan. 2015
Firstpage :
59
Lastpage :
66
Abstract :
Ranking of association rules is currently an interesting topic in data mining and bioinformatics. The huge number of evolved rules of items (or, genes) by association rule mining (ARM) algorithms makes confusion to the decision maker. In this article, we propose a weighted rule-mining technique (say, RANWAR or rank-based weighted association rule-mining) to rank the rules using two novel rule-interestingness measures, viz., rank-based weighted condensed support (wcs) and weighted condensed confidence (wcc) measures to bypass the problem. These measures are basically depended on the rank of items (genes). Using the rank, we assign weight to each item. RANWAR generates much less number of frequent itemsets than the state-of-the-art association rule mining algorithms. Thus, it saves time of execution of the algorithm. We run RANWAR on gene expression and methylation datasets. The genes of the top rules are biologically validated by Gene Ontologies (GOs) and KEGG pathway analyses. Many top ranked rules extracted from RANWAR that hold poor ranks in traditional Apriori, are highly biologically significant to the related diseases. Finally, the top rules evolved from RANWAR, that are not in Apriori, are reported.
Keywords :
bioinformatics; data mining; ontologies (artificial intelligence); KEGG pathway analyses; RANWAR; association rule mining algorithm; bioinformatics; data mining; decision maker; gene expression dataset; gene ontology; methylation dataset; rank-based weighted association rule mining; rank-based weighted condensed support measurement; rule-interestingness measurement; weighted condensed confidence measurement; weighted rule-mining technique; Association rules; Gene expression; Image color analysis; Itemsets; Radio access networks; $RANWAR$; $wcc$ ; $wcs$; Gene-ranking; Limma; gene-weight; weighted association rule mining;
fLanguage :
English
Journal_Title :
NanoBioscience, IEEE Transactions on
Publisher :
ieee
ISSN :
1536-1241
Type :
jour
DOI :
10.1109/TNB.2014.2359494
Filename :
6907995
Link To Document :
بازگشت