DocumentCode
2266909
Title
Asymmetric Bagging and Feature Selection for ActivitiesPrediction of Drug Molecules
Author
Guo-Zheng Li ; Hao-Hua Meng ; Yang, M.Q. ; Yang, James Y.
Author_Institution
Shanghai Univ., Shanghai
fYear
2007
fDate
13-15 Aug. 2007
Firstpage
108
Lastpage
114
Abstract
Activities of drug molecules can be predicted by QSAR (quantitative structure activity relationship) models, which overcome the disadvantage of high cost and long cycle by employing the traditional experimental method. With the fact that the number of drug molecules with positive activity is rather fewer than that with negatives, it is important to predict molecular activities considering such an unbalanced situation. Here, asymmetric bagging and feature selection are introduced into the problem and asymmetric bagging of support vector machines (AB-SVM) is proposed on predicting drug activities to treat the unbalanced problem. At the same time, the features extracted from the structures of drug molecules aspects prediction accuracy of QSAR models. Therefore, a novel algorithm named PRIFEAB is proposed, which applies an embedded feature selection method to remove redundant and irrelevant features for AB-SVM. Numerical experimental results on a data set of molecular activities show that AB-SVM improves the A UC values of molecular activities, and PRIFEAB with feature selection further helps to improve the prediction ability.
Keywords
drugs; pharmaceutical technology; support vector machines; PRIFEAB; activities prediction; asymmetric bagging; drug molecules; embedded feature selection; quantitative structure activity relationship models; support vector machines; Accuracy; Bagging; Bioinformatics; Costs; Drugs; Humans; Learning systems; Machine learning; Predictive models; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Computational Sciences, 2007. IMSCCS 2007. Second International Multi-Symposiums on
Conference_Location
Iowa City, IA
Print_ISBN
978-0-7695-3039-0
Type
conf
DOI
10.1109/IMSCCS.2007.89
Filename
4392587
Link To Document