Title :
Distributed large-scale tensor decomposition
Author :
de Almeida, Andre L. F. ; Kibangou, Alain Y.
Author_Institution :
Dept. of Teleinformatics Eng., Fed. Univ. of Ceara, Fortaleza, Brazil
Abstract :
Canonical Polyadic Decomposition (CPD), also known as PARAFAC, is a useful tool for tensor factorization. It has found application in several domains including signal processing and data mining. With the deluge of data faced in our societies, large-scale matrix and tensor factorizations become a crucial issue. Few works have been devoted to large-scale tensor factorizations. In this paper, we introduce a fully distributed method to compute the CPD of a large-scale data tensor across a network of machines with limited computation resources. The proposed approach is based on collaboration between the machines in the network across the three modes of the data tensor. Such a multi-modal collaboration allows an essentially unique reconstruction of the factor matrices in an efficient way. We provide an analysis of the computation and communication cost of the proposed scheme and address the problem of minimizing communication costs while maximizing the use of available computation resources.
Keywords :
data handling; distributed processing; matrix decomposition; tensors; CPD; PARAFAC; canonical polyadic decomposition; communication cost minimization; computation resources; data mining; distributed computation; distributed large-scale tensor decomposition; factor matrices; large-scale data tensor; large-scale matrix; large-scale tensor factorizations; multimodal collaboration; signal processing; Algorithm design and analysis; Collaboration; Data mining; Matrix decomposition; Servers; Tensile stress; Topology; Tensor decompositions; distributed computation; large-scale data;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6853551