Title of article :
The problem of bias in training data in regression problems in medical decision support
Author/Authors :
Mac Namee، نويسنده , , B. and Cunningham، نويسنده , , P. and Byrne، نويسنده , , S. and Corrigan، نويسنده , , O.I.، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2002
Pages :
20
From page :
51
To page :
70
Abstract :
This paper describes a bias problem encountered in a machine learning approach to outcome prediction in anticoagulant drug therapy. The outcome to be predicted is a measure of the clotting time for the patient; this measure is continuous and so the prediction task is a regression problem. Artificial neural networks (ANNs) are a powerful mechanism for learning to predict such outcomes from training data. However, experiments have shown that an ANN is biased towards values more commonly occurring in the training data and is thus, less likely to be correct in predicting extreme values. This issue of bias in training data in regression problems is similar to the associated problem with minority classes in classification. However, this bias issue in classification is well documented and is an on-going area of research. In this paper, we consider stratified sampling and boosting as solutions to this bias problem and evaluate them on this outcome prediction problem and on two other datasets. Both approaches produce some improvements with boosting showing the most promise.
Keywords :
Artificial neural networks , Medical decision support , Regression , Anticoagulant drug therapy
Journal title :
Artificial Intelligence In Medicine
Serial Year :
2002
Journal title :
Artificial Intelligence In Medicine
Record number :
1835853
Link To Document :
بازگشت