DocumentCode
3436762
Title
Neural Networks for Astronomical Data Analysis and Bayesian Inference
Author
Graff, Philip ; Feroz, Farhan ; Hobson, Michael P. ; Lasenby, Anthony
Author_Institution
NASA Goddard Space Flight Center, Gravitational Astrophys. Lab., Greenbelt, MD, USA
fYear
2013
fDate
7-10 Dec. 2013
Firstpage
16
Lastpage
23
Abstract
We present our generic neural network training algorithm, called Sky Net and the accelerated Bayesian inference algorithm, BAMBI. Sky Net combines multiple techniques already developed individually in the literature to create an efficient and robust machine-learning tool that is able to train large and deep feed-forward neural networks for use in a wide range of learning applications, such as regression, classification, density estimation, clustering and dimensionality reduction. Sky Net uses a powerful `pre-training´ method, to obtain a set of network parameters close to the true global maximum of the training objective function, followed by further optimisation using an automatically-regularised variant of Newton´s method that uses second-order derivative information to improve convergence, but without the need to evaluate or store the full Hessian matrix, by using a fast approximate method to calculate Hessian-vector products. This combination of methods allows for the training of complicated networks that are difficult to optimise using standard back propagation techniques. The blind accelerated multimodal Bayesian inference (BAMBI) algorithm implements the MultiNest package for nested sampling as well as the training of an artificial neural network by Sky Net to learn the likelihood function. In the case of computationally expensive likelihoods, this allows the substitution of a much more rapid approximation in order to increase significantly the speed of the analysis. Astrophysical examples are provided for both Sky Net and BAMBI.
Keywords
Bayes methods; Hessian matrices; Newton method; astronomy computing; data analysis; inference mechanisms; neural nets; BAMBI; Hessian matrix; Hessian-vector products; MultiNest package; Newton method; Sky Net; astronomical data analysis; blind accelerated multimodal Bayesian inference algorithm; fast approximate method; feed-forward neural networks; machine-learning tool; neural network training algorithm; second-order derivative information; training objective function; Approximation methods; Artificial neural networks; Bayes methods; Inference algorithms; Training; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops (ICDMW), 2013 IEEE 13th International Conference on
Conference_Location
Dallas, TX
Print_ISBN
978-1-4799-3143-9
Type
conf
DOI
10.1109/ICDMW.2013.82
Filename
6753898
Link To Document