Title :
Overfitting by PSO trained feedforward neural networks
Author :
Van Wyk, Andrich B. ; Engelbrecht, Andries P.
Author_Institution :
Dept. of Comput. Sci., Univ. of Pretoria, Pretoria, South Africa
Abstract :
The purpose of this paper is to investigate the overfitting behavior of particle swarm optimization (PSO) trained neural networks. Neural networks trained with PSOs using the global best, local best and Von Neumann information sharing topologies are investigated. Experiments are conducted on five classification and five time series regression problems. It is shown that differences exist in the degree of overfitting between the different topologies. Additionally, non-convergence of the swarms is witnessed, which is hypothetically attributed to the use of a bounded activation function in the neural networks. The hypothesis is supported by experiments conducted using an unbounded activation function in the neural network hidden layer, which lead to convergent swarms. Additionally this also lead to drastically reduced overfitting by the neural networks.
Keywords :
feedforward neural nets; particle swarm optimisation; regression analysis; time series; PSO trained feedforward neural networks; Von Neumann information sharing topologies; bounded activation function; neural network hidden layer; overfitting behavior; particle swarm optimization; time series regression problems; Artificial neural networks; Equations; Network topology; Neurons; Particle measurements; Topology; Training;
Conference_Titel :
Evolutionary Computation (CEC), 2010 IEEE Congress on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-6909-3
DOI :
10.1109/CEC.2010.5586333