DocumentCode :
16772
Title :
Weighted Tanimoto Extreme Learning Machine with Case Study in Drug Discovery
Author :
Czarnecki, Wojciech Marian
Author_Institution :
Fac. of Math. & Comput. Sci., Jagiellonian Univ., Krakow, Poland
Volume :
10
Issue :
3
fYear :
2015
fDate :
Aug. 2015
Firstpage :
19
Lastpage :
29
Abstract :
Machine learning methods are becoming more and more popular in the field of computer-aided drug design. The specific data characteristic, including sparse, binary representation as well as noisy, imbalanced datasets, presents a challenging binary classification problem. Currently, two of the most successful models in such tasks are the Support Vector Machine (SVM) and Random Forest (RF). In this paper, we introduce a Weighted Tanimoto Extreme Learning Machine (T-WELM), an extremely simple and fast method for predicting chemical compound biological activity and possibly other data with discrete, binary representation. We show some theoretical properties of the proposed model including the ability to learn arbitrary sets of examples. Further analysis shows numerous advantages of T-WELM over SVMs, RFs and traditional Extreme Learning Machines (ELM) in this particular task. Experiments performed on 40 large datasets of thousands of chemical compounds show that T-WELMs achieve much better classification results and are at the same time faster in terms of both training time and further classification than both ELM models and other state-of-the-art methods in the field.
Keywords :
drug delivery systems; learning (artificial intelligence); medical computing; pattern classification; support vector machines; RF; SVM; T-WELM; binary classification problem; chemical compound biological activity prediction; computer-aided drug design; data characteristic; drug discovery; random forest; support vector machine; weighted Tanimoto extreme learning machine method; Biological system modeling; Compounds; Computational modeling; Design automation; Drugs; Fingerprint recognition; Machine learning;
fLanguage :
English
Journal_Title :
Computational Intelligence Magazine, IEEE
Publisher :
ieee
ISSN :
1556-603X
Type :
jour
DOI :
10.1109/MCI.2015.2437312
Filename :
7160842
Link To Document :
بازگشت