Title :
Lambda-gamma learning with feedforward neural networks using particle swarm optimization
Author :
Van Wyk, Andrich B. ; Engelbrecht, Andries P.
Author_Institution :
Dept. of Comput. Sci., Univ. of Pretoria, Pretoria, South Africa
Abstract :
The sigmoid function is a widely used, bounded activation function for feedforward neural networks (FFNNs). A problem with using bounded activation functions is that it necessitates scaling of the data to suit the fixed domain and range of the function. Alternatively the activation function itself can be adapted by learning the gradient and range of the function alongside the FFNN weights. The purpose of this paper is to investigate whether the particle swarm optimization (PSO) algorithm is capable of training FFNNs that use adaptive sigmoid activation functions. The PSO algorithm is also compared against the gradient based lambda-gamma backpropagation learning algorithm (LG-BP) on five classification and five regression data sets. Experiments are conducted with scaled and unscaled input data as well as target output ranges of increasing size. The PSO algorithm proves capable of training adaptive activation function FFNNs and significantly outperforms the LG-BP algorithm on all problems. With the PSO, the use of adaptive activation functions improves the training accuracy of the FFNN, but leads to worse generalization performance due to overfitting. Increasing the size of the target output range increases the overfitting and worsens the generalization performance. Less overfitting is witnessed on data sets with unscaled input data.
Keywords :
backpropagation; feedforward neural nets; particle swarm optimisation; pattern classification; regression analysis; transfer functions; PSO algorithm; adaptive sigmoid activation function; data classification; data regression; feedforward neural network; gradient based lambda-gamma backpropagation learning algorithm; particle swarm optimization; Accuracy; Artificial neural networks; Diabetes; Equations; Neurons; Optimization; Training;
Conference_Titel :
Swarm Intelligence (SIS), 2011 IEEE Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-61284-053-6
DOI :
10.1109/SIS.2011.5952561