Title of article
Zipfʹs Law in Importance of Genes for Cancer Classification Using Microarray Data
Author/Authors
LI، نويسنده , , WENTIAN and YANG، نويسنده , , YANING، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2002
Pages
13
From page
539
To page
551
Abstract
Using a measure of how differentially expressed a gene is in two biochemically/phenotypically different conditions, we can rank all genes in a microarray dataset. We have shown that the falling-off of this measure (normalized maximum likelihood in a classification model such as logistic regression) as a function of the rank is typically a power-law function. This power-law function in other similar ranked plots are known as the Zipfʹs law, observed in many natural and social phenomena. The presence of this power-law function prevents an intrinsic cutoff point between the “important” genes and “irrelevant” genes. We have shown that similar power-law functions are also present in permuted dataset, and provide an explanation from the well-known χ2 distribution of likelihood ratios. We discuss the implication of this Zipfʹs law on gene selection in a microarray data analysis, as well as other characterizations of the ranked likelihood plots such as the rate of fall-off of the likelihood.
Journal title
Journal of Theoretical Biology
Serial Year
2002
Journal title
Journal of Theoretical Biology
Record number
1535580
Link To Document