DocumentCode :
1563404
Title :
Preprocessing based solution for the vanishing gradient problem in recurrent neural networks
Author :
Squartini, Stefano ; Hussain, Amir ; Piazza, Francesco
Author_Institution :
Dipt. di Elettronica e Autom., Ancona Univ., Italy
Volume :
5
fYear :
2003
Abstract :
In this paper, a possible solution to the vanishing gradient problem in recurrent neural networks (RNN) is proposed. The main idea consists of pre-processing the signal (a time series typically) through a wavelet decomposition, in order to separate the short term information from the long term one, and treating each scale by different RNNs. The partial results concerning all the different scales of time and frequencies are combined by another ´expert´ (a nonlinear structure typically) in order to achieve the final goal. This new approach is distinct from the other ones reported in the literature to-date, as it tends to simplify the RNN´s learning, working directly at the signal level and avoiding relevant changes in network architecture and learning techniques. The overall system (called the recurrent multiscale network, RMN) is described and its performance tested through typical tasks, namely the latching problem and time series prediction.
Keywords :
discrete wavelet transforms; gradient methods; learning (artificial intelligence); recurrent neural nets; time series; DWT; RMN; RNN learning; RNN signal preprocessing; frequency scales; latching problem; learning techniques; network architecture; partial result combination; recurrent multiscale network; recurrent neural networks; short/long term information separation; time scales; time series prediction; vanishing gradient problem; wavelet decomposition; Computer architecture; Computer networks; Delay effects; Digital signal processing; Electronic mail; Information analysis; Intelligent networks; Neural networks; Recurrent neural networks; Signal processing algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems, 2003. ISCAS '03. Proceedings of the 2003 International Symposium on
Print_ISBN :
0-7803-7761-3
Type :
conf
DOI :
10.1109/ISCAS.2003.1206412
Filename :
1206412
Link To Document :
بازگشت