DocumentCode :
47768
Title :
Bayesian Nonparametric Models for Multiway Data Analysis
Author :
Xu, Zongben ; Yan, Fengping ; Qi, Yaoyao
Author_Institution :
School of Computer Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
Volume :
37
Issue :
2
fYear :
2015
fDate :
Feb. 2015
Firstpage :
475
Lastpage :
487
Abstract :
Tensor decomposition is a powerful computational tool for multiway data analysis. Many popular tensor decomposition approaches—such as the Tucker decomposition and CANDECOMP/PARAFAC (CP)—amount to multi-linear factorization. They are insufficient to model (i) complex interactions between data entities, (ii) various data types (e.g., missing data and binary data), and (iii) noisy observations and outliers. To address these issues, we propose tensor-variate latent nonparametric Bayesian models for multiway data analysis. We name these models InfTucker. These new models essentially conduct Tucker decomposition in an infinite feature space. Unlike classical tensor decomposition models, our new approaches handle both continuous and binary data in a probabilistic framework. Unlike previous Bayesian models on matrices and tensors, our models are based on latent Gaussian or t processes with nonlinear covariance functions. Moreover, on network data, our models reduce to nonparametric stochastic blockmodels and can be used to discover latent groups and predict missing interactions. To learn the models efficiently from data, we develop a variational inference technique and explore properties of the Kronecker product for computational efficiency. Compared with a classical variational implementation, this technique reduces both time and space complexities by several orders of magnitude. On real multiway and network data, our new models achieved significantly higher prediction accuracy than state-of-art tensor decomposition methods and blockmodels.
Keywords :
Bayes methods; Computational modeling; Data models; Gaussian processes; Matrix decomposition; Noise; Tensile stress; Algorithms for data and knowledge management; Gaussian process; Machine learning; Multiway analysis; network modeling; nonparametric Bayes; random graphs and exchangeable arrays; stochastic blockmodel; tensor/matrix factorization;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2013.201
Filename :
6629993
Link To Document :
بازگشت