DocumentCode :
1696960
Title :
Ensembles of classifiers and their aplication to bioinformatics problems
Author :
Palade, Vasile
Author_Institution :
Comput. Lab., Oxford Univ., Oxford, UK
fYear :
2010
Firstpage :
1
Lastpage :
1
Abstract :
Summary form only given. Machine Learning has become a very popular approach in addressing problems in the Computational Biology and Bioinformatics area. In addition, multi-classifier systems have also gained popularity among researchers working in machine learning and applications for their ability to fuse together multiple models and obtain better overall accuracy and classification results. This talk is concerned with current issues in the design of multi-classifier systems and presents some multi-classifier developments for several bioinformatics problems. The talk will first present an overview and current status of machine learning methods in bioinformatics and computational biology. The talk will then bring in some important issues in building ensembles of classifiers, with a focus on the diversity and combination of individual classifiers. Few diversification and combination schemes are presented along with guidelines for the selection of different training paradigms and performance metrics, based on the properties and distribution of the data. Then, the presentation will proceed with introducing our computational intelligence based multi-classifier developments for solving several bioinformatics problems, such as recognizing sequences in DNA strings, micro-array gene expression data analysis, protein structure prediction. The talk will also present related machine learning issues in developing such systems, such as learning from imbalanced datasets and using appropriate performance metrics for model selection and evaluation. The presented approaches and results will advocate that ensembles of classifiers can be used as effective modelling tools in solving challenging bioinformatics problems.
Keywords :
DNA; bioinformatics; learning (artificial intelligence); pattern classification; proteins; software metrics; software performance evaluation; DNA string sequences; bioinformatics problems; classifier ensembles; computational biology; computational intelligence; imbalanced datasets; machine learning; microarray gene expression data analysis; multiclassifier systems; performance metrics; protein structure prediction; training paradigms; Bioinformatics; Biological system modeling; Buildings; Computational biology; Computational modeling; Educational institutions; Machine learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer-Based Medical Systems (CBMS), 2010 IEEE 23rd International Symposium on
Conference_Location :
Perth, WA
ISSN :
1063-7125
Print_ISBN :
978-1-4244-9167-4
Type :
conf
DOI :
10.1109/CBMS.2010.6042609
Filename :
6042609
Link To Document :
بازگشت