Title of article
A comparison of classification models to identify the Fragile X Syndrome
Author/Authors
David Bock، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2008
Pages
12
From page
233
To page
244
Abstract
The main models of machine learning are briefly reviewed and considered for building a classifier to identify
the Fragile X Syndrome (FXS).We have analyzed 172 patients potentially affected by FXS in Andalusia
(Spain) and, by means of a DNA test, each member of the data set is known to belong to one of two classes:
affected, not affected. The whole predictor set, formed by 40 variables, and a reduced set with only nine
predictors significantly associated with the response are considered. Four alternative base classification
models have been investigated: logistic regression, classification trees, multilayer perceptron and support
vector machines. For both predictor sets, the best accuracy, considering both the mean and the standard
deviation of the test error rate, is achieved by the support vector machines, confirming the increasing
importance of this learning algorithm. Three ensemble methods – bagging, random forests and boosting –
were also considered, amongst which the bagged versions of support vector machines stand out, especially
when they are constructed with the reduced set of predictor variables. The analysis of the sensitivity, the
specificity and the area under the ROC curve agrees with the main conclusions extracted from the accuracy
results. All of these models can be fitted by free R programs.
Keywords
Fragile X Syndrome , Support vector machines , Multilayer perceptron , Classification trees , logistic regression , Ensemble methods , R system
Journal title
JOURNAL OF APPLIED STATISTICS
Serial Year
2008
Journal title
JOURNAL OF APPLIED STATISTICS
Record number
712192
Link To Document