• DocumentCode
    2638969
  • Title

    Gene family identification network design

  • Author

    Wu, Cathy H. ; Shivakumar, Sailaja

  • Author_Institution
    Dept. of Epidemiology/Biomath., Texas Univ. Health Center, Tyler, TX, USA
  • fYear
    1998
  • fDate
    21-23 May 1998
  • Firstpage
    103
  • Lastpage
    110
  • Abstract
    The exponential accumulation of molecular data will facilitate the discovery of new knowledge by using information embedded within families of homologous sequences. As an approach to the management and analysis of sequence data, we have developed an integrated system, termed GeneFIND (Gene Family Identification Network Design), for database searching against gene families. It provides rapid and accurate protein family identification by combining global and motif sequence similarities and incorporating ProClass family information. Multilevel filters are used, starting with the MOTIFIND neural network and BLAST search, followed by SSEARCH alignment motif pattern match, hidden Markov modeling of motifs and ClustalW motif alignment. GeneFIND has been implemented as a full-scale system for the classification of more than 1000 ProSite and 3000 PIR families. It is used to identify thousands of new family members and is well suited for genomic sequence analysis
  • Keywords
    biology computing; database management systems; filtering theory; genetics; hidden Markov models; information retrieval; neural nets; pattern recognition; BLAST search; ClustalW motif alignment; GeneFIND; HMM; MOTIFIND neural network; PIR families; ProClass family information; ProSite families; SSEARCH alignment motif pattern match; database searching; gene family identification network design; genomic sequence analysis; global sequence similarities; hidden Markov modeling; homologous sequences; molecular data; motif sequence similarities; multilevel filters; protein family identification; sequence data analysis; sequence data management; Bioinformatics; Data analysis; Databases; Genomics; Hidden Markov models; Matched filters; Neural networks; Pattern matching; Proteins;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligence and Systems, 1998. Proceedings., IEEE International Joint Symposia on
  • Conference_Location
    Rockville, MD
  • Print_ISBN
    0-8186-8548-4
  • Type

    conf

  • DOI
    10.1109/IJSIS.1998.685426
  • Filename
    685426