DocumentCode :
104462
Title :
A Survey and Comparative Study of Statistical Tests for Identifying Differential Expression from Microarray Data
Author :
Bandyopadhyay, Supriyo ; Mallik, S. ; Mukhopadhyay, Amit
Author_Institution :
Machine Intell. Unit, Indian Stat. Inst., Kolkata, India
Volume :
11
Issue :
1
fYear :
2014
fDate :
Jan.-Feb. 2014
Firstpage :
95
Lastpage :
115
Abstract :
DNA microarray is a powerful technology that can simultaneously determine the levels of thousands of transcripts (generated, for example, from genes/miRNAs) across different experimental conditions or tissue samples. The motto of differential expression analysis is to identify the transcripts whose expressions change significantly across different types of samples or experimental conditions. A number of statistical testing methods are available for this purpose. In this paper, we provide a comprehensive survey on different parametric and non-parametric testing methodologies for identifying differential expression from microarray data sets. The performances of the different testing methods have been compared based on some real-life miRNA and mRNA expression data sets. For validating the resulting differentially expressed miRNAs, the outcomes of each test are checked with the information available for miRNA in the standard miRNA database PhenomiR 2.0. Subsequently, we have prepared different simulated data sets of different sample sizes (from 10 to 100 per group/population) and thereafter the power of each test have been calculated individually. The comparative simulated study might lead to formulate robust and comprehensive judgements about the performance of each test in the basis of assumption of data distribution. Finally, a list of advantages and limitations of the different statistical tests has been provided, along with indications of some areas where further studies are required.
Keywords :
DNA; RNA; biological tissues; lab-on-a-chip; molecular biophysics; statistical analysis; DNA microarray data sets; data distribution; differential expression analysis; genes; mRNA expression data sets; miRNA expression data sets; nonparametric testing methodology; standard miRNA database PhenomiR 2.0; statistical testing; statistical testing methods; tissue samples; Analysis of variance; Computational biology; Correlation; Sociology; Standards; Testing; Differentially expressed transcripts; multiple testing corrections; parametric and nonparametric tests; power of test;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2013.147
Filename :
6671571
Link To Document :
بازگشت