Author/Authors :
Car, Zlatan Faculty of Engineering Rijeka - University of Rijeka - Vukovarska - Rijeka, Croatia , Baressi Šegota, Sandi Faculty of Engineering Rijeka - University of Rijeka - Vukovarska - Rijeka, Croatia , Anđelić, Nikola Faculty of Engineering Rijeka - University of Rijeka - Vukovarska - Rijeka, Croatia , Lorencin, Ivan Faculty of Engineering Rijeka - University of Rijeka - Vukovarska - Rijeka, Croatia , Mrzljak, Vedran Faculty of Engineering Rijeka - University of Rijeka - Vukovarska - Rijeka, Croatia
Abstract :
Coronavirus (COVID-19) is a highly infectious disease that has captured the attention of the worldwide public. Modeling of such
diseases can be extremely important in the prediction of their impact. While classic, statistical, modeling can provide satisfactory
models, it can also fail to comprehend the intricacies contained within the data. In this paper, authors use a publicly available
dataset, containing information on infected, recovered, and deceased patients in 406 locations over 51 days (22nd January 2020
to 12th March 2020). This dataset, intended to be a time-series dataset, is transformed into a regression dataset and used in
training a multilayer perceptron (MLP) artificial neural network (ANN). The aim of training is to achieve a worldwide model of
the maximal number of patients across all locations in each time unit. Hyperparameters of the MLP are varied using a grid
search algorithm, with a total of 5376 hyperparameter combinations. Using those combinations, a total of 48384 ANNs are
trained (16128 for each patient group—deceased, recovered, and infected), and each model is evaluated using the coefficient of
determination (R2). Cross-validation is performed using K-fold algorithm with 5-folds. Best models achieved consists of 4
hidden layers with 4 neurons in each of those layers, and use a ReLU activation function, with R2 scores of 0.98599 for
confirmed, 0.99429 for deceased, and 0.97941 for recovered patient models. When cross-validation is performed, these scores
drop to 0.94 for confirmed, 0.781 for recovered, and 0.986 for deceased patient models, showing high robustness of the deceased
patient model, good robustness for confirmed, and low robustness for recovered patient model.