DocumentCode :
2277901
Title :
Lambda-gamma learning with feedforward neural networks using particle swarm optimization
Author :
Van Wyk, Andrich B. ; Engelbrecht, Andries P.
Author_Institution :
Dept. of Comput. Sci., Univ. of Pretoria, Pretoria, South Africa
fYear :
2011
fDate :
11-15 April 2011
Firstpage :
1
Lastpage :
8
Abstract :
The sigmoid function is a widely used, bounded activation function for feedforward neural networks (FFNNs). A problem with using bounded activation functions is that it necessitates scaling of the data to suit the fixed domain and range of the function. Alternatively the activation function itself can be adapted by learning the gradient and range of the function alongside the FFNN weights. The purpose of this paper is to investigate whether the particle swarm optimization (PSO) algorithm is capable of training FFNNs that use adaptive sigmoid activation functions. The PSO algorithm is also compared against the gradient based lambda-gamma backpropagation learning algorithm (LG-BP) on five classification and five regression data sets. Experiments are conducted with scaled and unscaled input data as well as target output ranges of increasing size. The PSO algorithm proves capable of training adaptive activation function FFNNs and significantly outperforms the LG-BP algorithm on all problems. With the PSO, the use of adaptive activation functions improves the training accuracy of the FFNN, but leads to worse generalization performance due to overfitting. Increasing the size of the target output range increases the overfitting and worsens the generalization performance. Less overfitting is witnessed on data sets with unscaled input data.
Keywords :
backpropagation; feedforward neural nets; particle swarm optimisation; pattern classification; regression analysis; transfer functions; PSO algorithm; adaptive sigmoid activation function; data classification; data regression; feedforward neural network; gradient based lambda-gamma backpropagation learning algorithm; particle swarm optimization; Accuracy; Artificial neural networks; Diabetes; Equations; Neurons; Optimization; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Swarm Intelligence (SIS), 2011 IEEE Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-61284-053-6
Type :
conf
DOI :
10.1109/SIS.2011.5952561
Filename :
5952561
Link To Document :
بازگشت