Title : 
Nonextensive entropy and regularization for adaptive learning
         
        
            Author : 
Anastasiadis, Aristoklis D. ; Magoulas, George D.
         
        
            Author_Institution : 
Sch. of Comput. Sci. & Inf. Syst., London Univ., UK
         
        
        
        
        
        
            Abstract : 
A challenging situation gradient-based learning algorithms encounter is an occasional converge to undesired local minima. To alleviate this situation, this paper builds on the theory of nonextensive statistical mechanics to develop a new adaptive gradient-based learning scheme that applies a sign-based weight adjustment, inspired from the Rprop algorithm, on a perturbed version of the original error function. The perturbations are characterized by the q entropic index of the nonextensive entropy, and their impact is controlled by means of regularization. This approach modifies the error landscape at each iteration allowing the algorithm to explore previously unavailable regions of the error surface, and possibly escape undesired local minima. The performance of the adaptive scheme is empirically evaluated using problems from the UCI repository of machine learning databases and other classic benchmarks.
         
        
            Keywords : 
convergence; entropy; gradient methods; learning (artificial intelligence); neural nets; simulated annealing; Rprop algorithm; adaptive gradient based learning algorithms; convergence; error landscape modification; iteration method; local minima; machine learning databases; neural nets; nonextensive entropy; nonextensive regularization; nonextensive statistical mechanics; q entropic index; sign based weight adjustment; simulated annealing; Artificial neural networks; Computer science; Databases; Educational institutions; Entropy; Information systems; Machine learning; Machine learning algorithms; Neural networks; Simulated annealing;
         
        
        
        
            Conference_Titel : 
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
         
        
        
            Print_ISBN : 
0-7803-8359-1
         
        
        
            DOI : 
10.1109/IJCNN.2004.1380082