DocumentCode :
680177
Title :
Fast Westfall-Young permutation procedure for combinatorial regulation discovery
Author :
Terada, Asako ; Tsuda, Kazuhiko ; Sese, J.
Author_Institution :
Dept. of Comput. Sci., Tokyo Inst. of Technol., Tokyo, Japan
fYear :
2013
fDate :
18-21 Dec. 2013
Firstpage :
153
Lastpage :
158
Abstract :
Three or more transcription factors (TFs) often work together, and the combinatorial regulations are essential in cellular machinery. However, it is impossible to discover statistically significant sets of TF binding motifs due to the necessity of the multiple testing procedure. To improve the sensitivity of widely used Bonferroni correction or its modified methods, such as Holm procedure, Westfall-Young permutation procedure (WY-procedure) has often been applied. However, few studies have used WY-procedure for the discoveries of the combinatorial effects of the motifs because of the extremely large computational time. In this paper, we propose an efficient branch-and-bound algorithm to perform WY-procedure to enumerate statistically significant motif combinations. When we use WY-procedure for the combinatorial regulation discovery, finding the minimum P-value from each permuted dataset consumes an enormous amount of time. We show that a combination that has the possibility to achieve the minimum P-value appears with high frequency over the threshold in dataset. This property enables a frequent itemset mining algorithm to efficiently select the candidates to achieve the minimum P-value. Our demonstrations using yeast and human transcriptome datasets show that the proposed algorithm is orders-of-magnitude faster than WY-procedure, and can practically list statistically significant motif combinations even when any combinations are considered.
Keywords :
biology computing; cellular biophysics; combinatorial mathematics; data mining; genetics; genomics; statistical analysis; Bonferroni correction; Fast Westfall-Young permutation procedure; Holm procedure; TF binding motifs; WY-procedure; branch-and-bound algorithm; cellular machinery; combinatorial motif effect discoveries; combinatorial regulation discovery; combinatorial regulations; frequent itemset mining algorithm; human transcriptome datasets; large computational time; minimum P-value; modified methods; multiple testing procedure; orders-of-magnitude; permuted dataset; statistically significant motif combinations; transcription factors; yeast transcriptome datasets; Acceleration; Gene expression; Itemsets; Sensitivity; Testing; Time-frequency analysis; Combinatorial Regulation; Gene Expression; Motif Combination; Multiple Test; Westfall-Young Permutation Procedure;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2013 IEEE International Conference on
Conference_Location :
Shanghai
Type :
conf
DOI :
10.1109/BIBM.2013.6732479
Filename :
6732479
Link To Document :
بازگشت