DocumentCode :
423939
Title :
Localized neural network based distributional learning for knowledge discovery in protein databases
Author :
Pokrajac, Dragoljub ; Lazarevic, Aleksandar ; Singleton, Teresa ; Obradovic, Zoran
Author_Institution :
Delaware State Univ., Dover, DE, USA
Volume :
3
fYear :
2004
fDate :
25-29 July 2004
Firstpage :
1663
Abstract :
We investigate the application of localized neural network-based distributional learning techniques for characterizing interesting groups and potentially new types of disorder proteins. Instead of employing a single autoassociator model for learning global distributions of ordered and disordered classes, clustering-based partitioning techniques are first applied independently to both ordered and disordered labeled data set to identify regions of similar characteristics. Subsequently, local autoassociators are employed on labeled data to learn distribution of each cluster. These local autoassociators are used in testing phase to assign each tuple from the unlabeled data set to the cluster closest in distributional sense. Obtained partitions are analyzed for the presence and frequency of the expert-annotated keywords. Frequency comparison is applied to provide insight of keywords sensitive to the distribution heterogeneity and disorder/order labeling. Experimental results on a labeled database of confirmed order and disorder proteins and unlabeled data extracted from SWISS_PROT database are consistent with related literature and can provide further insight into relationship between protein similarity, keyword labeling and the disorder property.
Keywords :
biology computing; data mining; learning (artificial intelligence); neural nets; proteins; SWISS PROT database; biology computing; clustering based partitioning techniques; distributional learning; expert annotated keywords; knowledge discovery; labeled database; localized neural network; protein databases; single autoassociator model; DNA; Data mining; Databases; Frequency; Intelligent networks; Labeling; Neural networks; Predictive models; Proteins; Proteomics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN :
1098-7576
Print_ISBN :
0-7803-8359-1
Type :
conf
DOI :
10.1109/IJCNN.2004.1380849
Filename :
1380849
Link To Document :
بازگشت