مرکز منطقه ای اطلاع رساني علوم و فناوري - Generalisation Performance of Artificial Neural Networks for Near Infrared Spectral Analysis

Abstract :

Generalisation performance of artificial neural networks (ANNs) is very important when a trained network analyses unseen data. It is associated with factors such as, representativeness and number of training samples, model structure and complexity, training procedures, and appropriate data representation. It is, however, difficult to set common benchmarks to assess the generalisation capabilities of different types of networks. This paper discusses the methods to improve generalisation of multi-layer perceptron (MLP) ANNs and provide experimental proof using near-infrared spectral data. Near-infrared spectra of wheat kernels infested with rice weevil (Sitophilus oryzae) at 11 infestation levels were collected and MLP networks were trained to quantitatively determine the insect infestation levels. Spectral data were pre-processed with principal component analysis (PCA) to reduce the input dimensionality and outliers were successfully detected using Hotelling T2 and Q statistics. Optimal network complexity was selected by evaluating generalisation performance of neural network using the Schwarzʹs Bayesian criteria, Akaikeʹs information criterion, and root mean squared errors of cross-validation (RMSECV) derived by 10-fold cross-validation. Model order assessed by RMSECV provided most economic network complexity. Stacked regression and network committees were shown to overcome the drawbacks of winner-takes-all strategy and gave prediction performance on test set with a lowest root mean squared error of prediction (RMSEP) of 3·5% and coefficient of determination r2 ⩾ 0·9. Prediction performance on average spectra had a lowest RMSEP of 1·2% with higher r2⩾0·97, and prediction performance for low infestation levels (⩽10%) had a lowest RMSEP of 0·4%.