DocumentCode :
2682493
Title :
The effects of pre-processing and parameter choices on searches through large gene expression data collections
Author :
Hibbs, Matthew A.
Author_Institution :
Jackson Lab., Bar Harbor, ME, USA
fYear :
2009
fDate :
17-21 May 2009
Firstpage :
1
Lastpage :
4
Abstract :
Gene expression microarray data collections contain information that can shed light on a variety of systems-level biological problems, including the functional roles of proteins and the regulatory networks governing their transcription and translation. However, the analysis of these data is complicated by unusual noise characteristics and variation between experimental protocols and technologies. Many of the efforts to confront these difficulties utilize additional pre-processing strategies to adjust the input data and/or alter parameter choices of their algorithmic approach. Here, we examine the effect of some of these techniques in the context of the SPELL similarity search algorithm. Our results demonstrate that pre-processing and parameter choices can greatly affect the performance of this approach. As such, these choices should be carefully considered and evaluated when performing a broad range of analyses of gene expression data.
Keywords :
bioinformatics; data analysis; genetics; proteins; search problems; SPELL similarity search algorithm; gene expression data collection; gene expression microarray data; noise characteristics; protein function; regulatory networks; systems-level biological problems; Bioinformatics; Data analysis; Gaussian distribution; Gene expression; Genomics; Laboratories; Performance evaluation; Proteins; Protocols; Systematics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Genomic Signal Processing and Statistics, 2009. GENSIPS 2009. IEEE International Workshop on
Conference_Location :
Minneapolis, MN
Print_ISBN :
978-1-4244-4761-9
Electronic_ISBN :
978-1-4244-4762-6
Type :
conf
DOI :
10.1109/GENSIPS.2009.5174357
Filename :
5174357
Link To Document :
بازگشت