Title of article :
Structure-based classification of active and inactive estrogenic compounds by decision tree, LVQ and kNN methods
Author/Authors :
Arja Asikainen، نويسنده , , Mikko Kolehmainen، نويسنده , , Juhani Ruuskanen، نويسنده , , Kari Tuppurainen، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2006
Pages :
16
From page :
658
To page :
673
Abstract :
The performance of decision tree (DT), learning vector quantization (LVQ), and k-nearest neighbour (kNN) methods classifying active and inactive estrogenic compounds in terms of their structure activity relationship (SAR) was evaluated. A set of 311 compounds was used for construction of the models, the predictive power of which was verified with separate training and test sets. Principal components derived from molecular descriptors calculated with DRAGON software were used as variables representing the structures of the compounds. Broadly, kNN had the best classification ability and DT the weakest, although the performance of each method was dependent on the group of compounds used for modelling. The best performance was obtained with kNN for the calf estrogen receptor data, averaging 98.3% of correctly classified compounds in the external tests. Overall, the results indicate that all the methods tested are suitable for the SAR classification of estrogenic compounds, producing models with a predictive power ranging from adequate to excellent.
Keywords :
SAR , Principal components , Endocrine disruptors , Sammon s mapping , Tooldiag , estrogen receptor
Journal title :
Chemosphere
Serial Year :
2006
Journal title :
Chemosphere
Record number :
738494
Link To Document :
بازگشت