• DocumentCode
    2737117
  • Title

    Poster: Issues in functional characterization and clustering of genes by literature mining

  • Author

    Dasigi, V. ; Karam, Orlando ; Pydimarri, Sailaja

  • fYear
    2011
  • fDate
    3-5 Feb. 2011
  • Firstpage
    239
  • Lastpage
    239
  • Abstract
    This paper studies the issues involved in characterizing the function of the genes that are involved in the life cycle of budding yeast, and in clustering them based on their potential functional similarities. Clustering results on these genes have been reported so we have a basis for comparison. The task of clustering genes is done in two steps: First, keywords corresponding to all genes of interest from a subset of MEDLINE database were extracted automatically using TF-IDF and Z-scores. In the second step, the classic K-means algorithm was used to group genes into clusters of genes based on the keyword features.
  • Keywords
    biology computing; cellular biophysics; data mining; genetics; microorganisms; MEDLINE database; TF-IDF; Z-scores; budding yeast; classic K-means algorithm; functional characterization; gene clustering; keyword extraction; life cycle; literature mining; Abstracts; Clustering algorithms; Context; Databases; Feature extraction; Libraries; Venus;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on
  • Conference_Location
    Orlando, FL
  • Print_ISBN
    978-1-61284-851-8
  • Type

    conf

  • DOI
    10.1109/ICCABS.2011.5729893
  • Filename
    5729893