DocumentCode :
1311228
Title :
Convergence Analyses on On-Line Weight Noise Injection-Based Training Algorithms for MLPs
Author :
Sum, John ; Chi-Sing Leung ; Ho, Kayla
Author_Institution :
Inst. of Technol. Manage., Nat. Chung Hsing Univ., Taichung, Taiwan
Volume :
23
Issue :
11
fYear :
2012
Firstpage :
1827
Lastpage :
1840
Abstract :
Injecting weight noise during training is a simple technique that has been proposed for almost two decades. However, little is known about its convergence behavior. This paper studies the convergence of two weight noise injection-based training algorithms, multiplicative weight noise injection with weight decay and additive weight noise injection with weight decay. We consider that they are applied to multilayer perceptrons either with linear or sigmoid output nodes. Let w(t) be the weight vector, let V(w) be the corresponding objective function of the training algorithm, let α >; 0 be the weight decay constant, and let μ(t) be the step size. We show that if μ(t)→ 0, then with probability one E[||w(t)||22] is bound and limt→∞||w(t)||2 exists. Based on these two properties, we show that if μ(t)→ 0, Σtμ(t)=∞, and Σtμ(t)2 <; ∞, then with probability one these algorithms converge. Moreover, w(t) converges with probability one to a point where ∇wV(w)=0.
Keywords :
convergence; multilayer perceptrons; probability; MLP; additive weight noise injection; convergence analysis; linear output nodes; multilayer perceptrons; multiplicative weight noise injection; online weight noise injection-based training algorithms; probability; sigmoid output nodes; weight decay; Additives; Convergence; Linear programming; Noise; Prediction algorithms; Training; Vectors; Additive noise; convergence; multilayer perceptron; multiplicative noise; weight noise injection;
fLanguage :
English
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
2162-237X
Type :
jour
DOI :
10.1109/TNNLS.2012.2210243
Filename :
6324446
Link To Document :
بازگشت