DocumentCode :
1346708
Title :
Data strip mining for the virtual design of pharmaceuticals with neural networks
Author :
Kewley, Robert H. ; Embrechts, Mark J. ; Breneman, Curt
Author_Institution :
Dept. of Syst. Eng., US Mil. Acad., West Point, NY, USA
Volume :
11
Issue :
3
fYear :
2000
fDate :
5/1/2000 12:00:00 AM
Firstpage :
668
Lastpage :
679
Abstract :
A novel neural network based technique, called “data strip mining” extracts predictive models from data sets which have a large number of potential inputs and comparatively few data points. This methodology uses neural network sensitivity analysis to determine which predictors are most significant in the problem. Neural network sensitivity analysis holds all but one input to a trained neural network constant while varying each input over its entire range to determine its effect on the output. Elimination of variables through neural network sensitivity analysis and predicting performance through model cross-validation allows the analyst to reduce the number of inputs and improve the model´s predictive ability at the same time. This paper demonstrates its effectiveness on a pair of problems from combinatorial chemistry with over 400 potential inputs each. For these data sets, model selection by neural sensitivity analysis outperformed other variable selection methods including the forward selection and genetic algorithm
Keywords :
chemistry computing; data mining; neural nets; pharmaceutical industry; sensitivity analysis; combinatorial chemistry; data strip mining; model cross-validation; neural network; pharmaceuticals; predictive models; sensitivity analysis; Chemistry; Data mining; Genetic algorithms; Input variables; Neural networks; Performance analysis; Pharmaceuticals; Predictive models; Sensitivity analysis; Strips;
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/72.846738
Filename :
846738
Link To Document :
بازگشت