• DocumentCode
    3189121
  • Title

    Discovering Gene Expression Data from the Tables of Full Text Publications

  • Author

    Mathiak, Brigitte ; Kupfer, Andreas ; Bartulos, Carolina Rio ; Scope, Tatjana ; Weiland, Johann ; Eckstein, Silke

  • Author_Institution
    Inst. fur Informationssysteme, Braunschweig
  • fYear
    2007
  • fDate
    28-31 Oct. 2007
  • Firstpage
    113
  • Lastpage
    118
  • Abstract
    Finding out which genes are expressed in which circumstances is one of the most common tasks in text mining for bioinformatics. But usually the derived data comes from the abstract or other describing texts in the literature. In the age of modern high-throughput microarray analysis, however, there is too much data to be described textual; instead this data often comes in form of tables. In this paper, we are looking specifically at the tables, an approach to our knowledge never described before. The goal is to attach gene names found in tables to their context for a convenient literature review. In order to do so, matching literature has to be downloaded and pre-processed. After that has been done, gene names or protein names can be found through a fast and reliable search, presenting all the associated literature at a glance.
  • Keywords
    biology computing; data mining; genetics; proteins; text analysis; bioinformatics; full text publications; gene expression data discovery; gene names; microarray analysis; protein names; tabular data; text mining; textual data; Bioinformatics; Biomedical optical imaging; Data mining; Diabetes; Gene expression; Natural languages; Optical character recognition software; Particle separators; Proteins; Text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops, 2007. ICDM Workshops 2007. Seventh IEEE International Conference on
  • Conference_Location
    Omaha, NE
  • Print_ISBN
    978-0-7695-3019-2
  • Electronic_ISBN
    978-0-7695-3033-8
  • Type

    conf

  • DOI
    10.1109/ICDMW.2007.29
  • Filename
    4476655