DocumentCode
472167
Title
Overlapping Probabilities of Top Ranking Gene Lists, Hypergeometric Distribution, and Stringency of Gene Selection Criterion
Author
Fury, Wen ; Batliwalla, Franak ; Gregersen, Peter K. ; Li, Wentian
Author_Institution
Regeneron Pharm., Tarrytown, NY
fYear
2006
fDate
Aug. 30 2006-Sept. 3 2006
Firstpage
5531
Lastpage
5534
Abstract
When the same set of genes appear in two top ranking gene lists in two different studies, it is often of interest to estimate the probability for this being a chance event. This overlapping probability is well known to follow the hypergeometric distribution. Usually, the lengths of top-ranking gene lists are assumed to be fixed, by using a pre-set criterion on, e.g., p-value for the t-test. We investigate how overlapping probability changes with the gene selection criterion, or simply, with the length of the top-ranking gene lists. It is concluded that overlapping probability is indeed a function of the gene list length, and its statistical significance should be quoted in the context of gene selection criterion
Keywords
genetics; molecular biophysics; statistical distributions; gene list length; gene selection criterion; hypergeometric distribution; overlapping probability; statistical analysis; Bioinformatics; Cities and towns; Diseases; Medical treatment; Ontologies; Pharmaceuticals; Probability; Proteins; Testing; USA Councils;
fLanguage
English
Publisher
ieee
Conference_Titel
Engineering in Medicine and Biology Society, 2006. EMBS '06. 28th Annual International Conference of the IEEE
Conference_Location
New York, NY
ISSN
1557-170X
Print_ISBN
1-4244-0032-5
Electronic_ISBN
1557-170X
Type
conf
DOI
10.1109/IEMBS.2006.260828
Filename
4463058
Link To Document