• DocumentCode
    2682493
  • Title

    The effects of pre-processing and parameter choices on searches through large gene expression data collections

  • Author

    Hibbs, Matthew A.

  • Author_Institution
    Jackson Lab., Bar Harbor, ME, USA
  • fYear
    2009
  • fDate
    17-21 May 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Gene expression microarray data collections contain information that can shed light on a variety of systems-level biological problems, including the functional roles of proteins and the regulatory networks governing their transcription and translation. However, the analysis of these data is complicated by unusual noise characteristics and variation between experimental protocols and technologies. Many of the efforts to confront these difficulties utilize additional pre-processing strategies to adjust the input data and/or alter parameter choices of their algorithmic approach. Here, we examine the effect of some of these techniques in the context of the SPELL similarity search algorithm. Our results demonstrate that pre-processing and parameter choices can greatly affect the performance of this approach. As such, these choices should be carefully considered and evaluated when performing a broad range of analyses of gene expression data.
  • Keywords
    bioinformatics; data analysis; genetics; proteins; search problems; SPELL similarity search algorithm; gene expression data collection; gene expression microarray data; noise characteristics; protein function; regulatory networks; systems-level biological problems; Bioinformatics; Data analysis; Gaussian distribution; Gene expression; Genomics; Laboratories; Performance evaluation; Proteins; Protocols; Systematics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Genomic Signal Processing and Statistics, 2009. GENSIPS 2009. IEEE International Workshop on
  • Conference_Location
    Minneapolis, MN
  • Print_ISBN
    978-1-4244-4761-9
  • Electronic_ISBN
    978-1-4244-4762-6
  • Type

    conf

  • DOI
    10.1109/GENSIPS.2009.5174357
  • Filename
    5174357