• DocumentCode
    423664
  • Title

    Nonextensive entropy and regularization for adaptive learning

  • Author

    Anastasiadis, Aristoklis D. ; Magoulas, George D.

  • Author_Institution
    Sch. of Comput. Sci. & Inf. Syst., London Univ., UK
  • Volume
    2
  • fYear
    2004
  • fDate
    25-29 July 2004
  • Firstpage
    1067
  • Abstract
    A challenging situation gradient-based learning algorithms encounter is an occasional converge to undesired local minima. To alleviate this situation, this paper builds on the theory of nonextensive statistical mechanics to develop a new adaptive gradient-based learning scheme that applies a sign-based weight adjustment, inspired from the Rprop algorithm, on a perturbed version of the original error function. The perturbations are characterized by the q entropic index of the nonextensive entropy, and their impact is controlled by means of regularization. This approach modifies the error landscape at each iteration allowing the algorithm to explore previously unavailable regions of the error surface, and possibly escape undesired local minima. The performance of the adaptive scheme is empirically evaluated using problems from the UCI repository of machine learning databases and other classic benchmarks.
  • Keywords
    convergence; entropy; gradient methods; learning (artificial intelligence); neural nets; simulated annealing; Rprop algorithm; adaptive gradient based learning algorithms; convergence; error landscape modification; iteration method; local minima; machine learning databases; neural nets; nonextensive entropy; nonextensive regularization; nonextensive statistical mechanics; q entropic index; sign based weight adjustment; simulated annealing; Artificial neural networks; Computer science; Databases; Educational institutions; Entropy; Information systems; Machine learning; Machine learning algorithms; Neural networks; Simulated annealing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-8359-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2004.1380082
  • Filename
    1380082