Title :
A block-approximate local Hessian-matrix analysis for CANFIS neuro-fuzzy modular network learning
Author_Institution :
Dept. of Ind. Manage., Nat. Taiwan Univ. of Sci. & Technol., Taipei, Taiwan
Abstract :
We describe how to alleviate the problem of plateau phenomena that may arise in learning with a CANFIS neuro-fuzzy modular network. The network model consists of multiple “local-expert” MLPs (multilayer perceptrons) mediated by fuzzy membership functions. Even with such a complex modular architecture, our recently-developed second-order stagewise backpropagation procedure efficiently evaluates the Hessian matrix of a given objective function, for which we employ the sum-squared-error measure. For concreteness, we use a small curve-fitting problem that allows us to demonstrate detailed analysis based on the Hessian matrix. In particular, we describe how to use a block-diagonal approximate local Hessian matrix associated with a bottleneck local-expert MLP that does not perform very well for a designated task. Since the “bad” performance implies relatively large residuals (or errors), the local Hessian matrix tends to be indefinite; therefore, it is worth exploiting the negative curvature to escape from the plateaus.
Keywords :
Hessian matrices; approximation theory; backpropagation; curve fitting; fuzzy neural nets; multilayer perceptrons; CANFIS neuro-fuzzy modular network learning; block-diagonal approximate local Hessian matrix; complex modular architecture; curve-fitting problem; fuzzy membership function; local-expert MLP; multilayer perceptron; objective function; plateau phenomena; second-order stagewise backpropagation procedure; sum-squared-error measure; Adaptation models; Backpropagation; Couplings; Fuzzy neural networks; Jacobian matrices; Multilayer perceptrons; Vectors; CANFIS neuro-fuzzy modular networks; negative curvature; plateau phenomena;
Conference_Titel :
Fuzzy Systems (FUZZ-IEEE), 2012 IEEE International Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-1507-4
Electronic_ISBN :
1098-7584
DOI :
10.1109/FUZZ-IEEE.2012.6251277