Title of article :
In silico classification of human maximum recommended daily dose based on modified random forest and substructure fingerprint Original Research Article
Author/Authors :
Dong-Sheng Cao، نويسنده , , Qian-Nan Hu، نويسنده , , Qing-Song Xu، نويسنده , , Yan-Ning Yang، نويسنده , , Jian-Chao Zhao، نويسنده , , Hong-Mei Lu، نويسنده , , Liang-Xiao Zhang، نويسنده , , Yi-Zeng Liang، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2011
Pages :
7
From page :
50
To page :
56
Abstract :
A modified random forest (RF) algorithm, as a novel machine learning technique, was developed to estimate the maximum recommended daily dose (MRDD) of a large and diverse pharmaceutical dataset for phase I human trials using substructure fingerprint descriptors calculated from simple molecular structure alone. This type of novel molecular descriptors encodes molecular structure in a series of binary bits that represent the presence or absence of particular substructures in the molecule and thereby can accurately and directly depict a series of local information hidden in this molecule. Two model validation approaches, 5-fold cross-validation and an independent validation set, were used for assessing the prediction capability of our models. The results obtained in this study indicate that the modified RF gave prediction accuracy of 80.45%, sensitivity of 75.08%, specificity of 84.85% for 5-fold cross-validation, and prediction accuracy of 80.5%, sensitivity of 76.47%, specificity of 83.48% for independent validation set, respectively, which are as a whole better than those by the original RF. At the same time, the important substructure fingerprints, recognized by the RF technique, gave some insights into the structure features related to toxicity of pharmaceuticals. This could help provide intuitive understanding for medicinal chemists.
Keywords :
Drug toxicity , Substructure fingerprint , Maximum recommended daily dose (MRDD) , Machine learning , Modified random forest
Journal title :
Analytica Chimica Acta
Serial Year :
2011
Journal title :
Analytica Chimica Acta
Record number :
1026405
Link To Document :
بازگشت