DocumentCode
2682493
Title
The effects of pre-processing and parameter choices on searches through large gene expression data collections
Author
Hibbs, Matthew A.
Author_Institution
Jackson Lab., Bar Harbor, ME, USA
fYear
2009
fDate
17-21 May 2009
Firstpage
1
Lastpage
4
Abstract
Gene expression microarray data collections contain information that can shed light on a variety of systems-level biological problems, including the functional roles of proteins and the regulatory networks governing their transcription and translation. However, the analysis of these data is complicated by unusual noise characteristics and variation between experimental protocols and technologies. Many of the efforts to confront these difficulties utilize additional pre-processing strategies to adjust the input data and/or alter parameter choices of their algorithmic approach. Here, we examine the effect of some of these techniques in the context of the SPELL similarity search algorithm. Our results demonstrate that pre-processing and parameter choices can greatly affect the performance of this approach. As such, these choices should be carefully considered and evaluated when performing a broad range of analyses of gene expression data.
Keywords
bioinformatics; data analysis; genetics; proteins; search problems; SPELL similarity search algorithm; gene expression data collection; gene expression microarray data; noise characteristics; protein function; regulatory networks; systems-level biological problems; Bioinformatics; Data analysis; Gaussian distribution; Gene expression; Genomics; Laboratories; Performance evaluation; Proteins; Protocols; Systematics;
fLanguage
English
Publisher
ieee
Conference_Titel
Genomic Signal Processing and Statistics, 2009. GENSIPS 2009. IEEE International Workshop on
Conference_Location
Minneapolis, MN
Print_ISBN
978-1-4244-4761-9
Electronic_ISBN
978-1-4244-4762-6
Type
conf
DOI
10.1109/GENSIPS.2009.5174357
Filename
5174357
Link To Document