• DocumentCode
    749944
  • Title

    An improved algorithm for learning long-term dependency problems in adaptive processing of data structures

  • Author

    Cho, Siu-Yeung ; Chi, Zheru ; Siu, Wan-chi ; Tsoi, Ah Chung

  • Author_Institution
    Dept. of Electron. & Inf. Eng., Hong Kong Polytech. Univ., Kowloon, China
  • Volume
    14
  • Issue
    4
  • fYear
    2003
  • fDate
    7/1/2003 12:00:00 AM
  • Firstpage
    781
  • Lastpage
    793
  • Abstract
    Many researchers have explored the use of neural-network representations for the adaptive processing of data structures. One of the most popular learning formulations of data structure processing is backpropagation through structure (BPTS). The BPTS algorithm has been successful applied to a number of learning tasks that involve structural patterns such as logo and natural scene classification. The main limitations of the BPTS algorithm are attributed to slow convergence speed and the long-term dependency problem for the adaptive processing of data structures. In this paper, an improved algorithm is proposed to solve these problems. The idea of this algorithm is to optimize the free learning parameters of the neural network in the node representation by using least-squares-based optimization methods in a layer-by-layer fashion. Not only can fast convergence speed be achieved, but the long-term dependency problem can also be overcome since the vanishing of gradient information is avoided when our approach is applied to very deep tree structures.
  • Keywords
    backpropagation; convergence; data structures; least squares approximations; neural nets; optimisation; adaptive processing; backpropagation through structure; data structures; free learning parameters; learning tasks; least-squares-based optimization methods; long-term dependency problems; neural-network representations; node representation; slow convergence speed; structural patterns; Artificial neural networks; Backpropagation algorithms; Convergence; Data structures; Layout; Neural networks; Neurons; Optimization methods; Signal processing algorithms; Tree data structures;
  • fLanguage
    English
  • Journal_Title
    Neural Networks, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9227
  • Type

    jour

  • DOI
    10.1109/TNN.2003.813831
  • Filename
    1215396