Title :
Gene family identification network design
Author :
Wu, Cathy H. ; Shivakumar, Sailaja
Author_Institution :
Dept. of Epidemiology/Biomath., Texas Univ. Health Center, Tyler, TX, USA
Abstract :
The exponential accumulation of molecular data will facilitate the discovery of new knowledge by using information embedded within families of homologous sequences. As an approach to the management and analysis of sequence data, we have developed an integrated system, termed GeneFIND (Gene Family Identification Network Design), for database searching against gene families. It provides rapid and accurate protein family identification by combining global and motif sequence similarities and incorporating ProClass family information. Multilevel filters are used, starting with the MOTIFIND neural network and BLAST search, followed by SSEARCH alignment motif pattern match, hidden Markov modeling of motifs and ClustalW motif alignment. GeneFIND has been implemented as a full-scale system for the classification of more than 1000 ProSite and 3000 PIR families. It is used to identify thousands of new family members and is well suited for genomic sequence analysis
Keywords :
biology computing; database management systems; filtering theory; genetics; hidden Markov models; information retrieval; neural nets; pattern recognition; BLAST search; ClustalW motif alignment; GeneFIND; HMM; MOTIFIND neural network; PIR families; ProClass family information; ProSite families; SSEARCH alignment motif pattern match; database searching; gene family identification network design; genomic sequence analysis; global sequence similarities; hidden Markov modeling; homologous sequences; molecular data; motif sequence similarities; multilevel filters; protein family identification; sequence data analysis; sequence data management; Bioinformatics; Data analysis; Databases; Genomics; Hidden Markov models; Matched filters; Neural networks; Pattern matching; Proteins;
Conference_Titel :
Intelligence and Systems, 1998. Proceedings., IEEE International Joint Symposia on
Conference_Location :
Rockville, MD
Print_ISBN :
0-8186-8548-4
DOI :
10.1109/IJSIS.1998.685426