• DocumentCode
    472167
  • Title

    Overlapping Probabilities of Top Ranking Gene Lists, Hypergeometric Distribution, and Stringency of Gene Selection Criterion

  • Author

    Fury, Wen ; Batliwalla, Franak ; Gregersen, Peter K. ; Li, Wentian

  • Author_Institution
    Regeneron Pharm., Tarrytown, NY
  • fYear
    2006
  • fDate
    Aug. 30 2006-Sept. 3 2006
  • Firstpage
    5531
  • Lastpage
    5534
  • Abstract
    When the same set of genes appear in two top ranking gene lists in two different studies, it is often of interest to estimate the probability for this being a chance event. This overlapping probability is well known to follow the hypergeometric distribution. Usually, the lengths of top-ranking gene lists are assumed to be fixed, by using a pre-set criterion on, e.g., p-value for the t-test. We investigate how overlapping probability changes with the gene selection criterion, or simply, with the length of the top-ranking gene lists. It is concluded that overlapping probability is indeed a function of the gene list length, and its statistical significance should be quoted in the context of gene selection criterion
  • Keywords
    genetics; molecular biophysics; statistical distributions; gene list length; gene selection criterion; hypergeometric distribution; overlapping probability; statistical analysis; Bioinformatics; Cities and towns; Diseases; Medical treatment; Ontologies; Pharmaceuticals; Probability; Proteins; Testing; USA Councils;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society, 2006. EMBS '06. 28th Annual International Conference of the IEEE
  • Conference_Location
    New York, NY
  • ISSN
    1557-170X
  • Print_ISBN
    1-4244-0032-5
  • Electronic_ISBN
    1557-170X
  • Type

    conf

  • DOI
    10.1109/IEMBS.2006.260828
  • Filename
    4463058