• DocumentCode
    2709803
  • Title

    Improving gradient-based learning algorithms for large scale feedforward networks

  • Author

    Ventresca, M. ; Tizhoosh, H.R.

  • Author_Institution
    Syst. Design Eng. Dept., Univ. of Waterloo, Waterloo, ON, Canada
  • fYear
    2009
  • fDate
    14-19 June 2009
  • Firstpage
    3212
  • Lastpage
    3219
  • Abstract
    Large scale neural networks have many hundreds or thousands of parameters (weights and biases) to learn, and as a result tend to have very long training times. Small scale networks can be trained quickly by using second-order information, but these fail for large architectures due to high computational cost. Other approaches employ local search strategies, which also add to the computational cost. In this paper we present a simple method, based on opposite transfer functions which greatly improve the convergence rate and accuracy of gradient-based learning algorithms. We use two variants of the backpropagation algorithm and common benchmark data to highlight the improvements. We find statistically significant improvements in both convergence speed and accuracy.
  • Keywords
    backpropagation; feedforward neural nets; gradient methods; search problems; statistical analysis; transfer functions; backpropagation algorithm; computational cost; convergence rate; gradient-based learning algorithm; large scale feedforward neural network; local search strategy; opposite transfer function; second-order information; statistical analysis; Backpropagation algorithms; Computational efficiency; Computer architecture; Computer networks; Convergence; Design engineering; Feedforward neural networks; Large-scale systems; Neural networks; Transfer functions; Large scale networks; backpropgation; gradient-based learning; opposite transfer functions; opposition-based computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2009. IJCNN 2009. International Joint Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1098-7576
  • Print_ISBN
    978-1-4244-3548-7
  • Electronic_ISBN
    1098-7576
  • Type

    conf

  • DOI
    10.1109/IJCNN.2009.5178798
  • Filename
    5178798