Title :
Exponential Family Tensor Factorization for Missing-Values Prediction and Anomaly Detection
Author :
Hayashi, Kohei ; Takenouchi, Takashi ; Shibata, Tomohiro ; Kamiya, Yuki ; Kato, Daishi ; Kunieda, Kazuo ; Yamada, Keiji ; Ikeda, Kazushi
Author_Institution :
Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan
Abstract :
In this paper, we study probabilistic modeling of heterogeneously attributed multi-dimensional arrays. The model can manage the heterogeneity by employing an individual exponential-family distribution for each attribute of the tensor array. These entries are connected by latent variables and are shared information across the different attributes. Because a Bayesian inference for our model is intractable, we cast the EM algorithm approximated by using the Lap lace method and Gaussian process. This approximation enables us to derive a predictive distribution for missing values in a consistent manner. Simulation experiments show that our method outperforms other methods such as PARAFAC and Tucker decomposition in missing-values prediction for cross-national statistics and is also applicable to discover anomalies in heterogeneous office-logging data.
Keywords :
Bayes methods; Gaussian processes; Laplace equations; belief networks; expectation-maximisation algorithm; inference mechanisms; matrix decomposition; prediction theory; sensor fusion; Bayesian inference; Gaussian process; Laplace method; anomaly detection; cross national statistic; exponential family tensor factorization; missing values prediction; multidimensional array; office logging data; probabilistic modeling; Bayesian probabilistic model; Gaussian process; data fusion; tensor factorization;
Conference_Titel :
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4244-9131-5
Electronic_ISBN :
1550-4786
DOI :
10.1109/ICDM.2010.39