DocumentCode
1476481
Title
Extended Hamiltonian Learning on Riemannian Manifolds: Theoretical Aspects
Author
Fiori, Simone
Author_Institution
Dipt. di Ing. Biomedica, Elettronicae Telecomun., Univ. Politec. delle Marche, Ancona, Italy
Volume
22
Issue
5
fYear
2011
fDate
5/1/2011 12:00:00 AM
Firstpage
687
Lastpage
700
Abstract
This paper introduces a general theory of extended Hamiltonian (second-order) learning on Riemannian manifolds, as an instance of learning by constrained criterion optimization. The dynamical learning equations are derived within the general framework of extended-Hamiltonian stationary-action principle and are expressed in a coordinate-free fashion. A theoretical analysis is carried out in order to compare the features of the dynamical learning theory with the features exhibited by the gradient-based ones. In particular, gradient-based learning is shown to be an instance of dynamical learning, and the classical gradient-based learning modified by a “momentum” term is shown to resemble discrete-time dynamical learning. Moreover, the convergence features of gradient-based and dynamical learning are compared on a theoretical basis. This paper discusses cases of learning by dynamical systems on manifolds of interest in the scientific literature, namely, the Stiefel manifold, the special orthogonal group, the Grassmann manifold, the group of symmetric positive definite matrices, the generalized flag manifold, and the real symplectic group of matrices.
Keywords
gradient methods; learning (artificial intelligence); Grassmann manifold; Riemannian manifold; Stiefel manifold; constrained criterion optimization; discrete-time dynamical learning; dynamical learning equation; extended Hamiltonian learning; extended-Hamiltonian stationary-action principle; gradient-based learning; second-order learning; Equations; Manifolds; Mathematical model; Measurement; Optimization; Tensile stress; Trajectory; Extended Hamiltonian (second-order) learning; Riemannian manifold; gradient-based (first-order) learning; learning by constrained criterion optimization; Algorithms; Artificial Intelligence; Computer Simulation; Mathematical Concepts; Models, Theoretical; Neural Networks (Computer);
fLanguage
English
Journal_Title
Neural Networks, IEEE Transactions on
Publisher
ieee
ISSN
1045-9227
Type
jour
DOI
10.1109/TNN.2011.2109395
Filename
5735228
Link To Document