Title :
Big Data Analysis with Signal Processing on Graphs: Representation and processing of massive data sets with irregular structure
Author :
Sandryhaila, Aliaksei ; Moura, Jose M. F.
Author_Institution :
Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
Analysis and processing of very large data sets, or big data, poses a significant challenge. Massive data sets are collected and studied in numerous domains, from engineering sciences to social networks, biomolecular research, commerce, and security. Extracting valuable information from big data requires innovative approaches that efficiently process large amounts of data as well as handle and, moreover, utilize their structure. This article discusses a paradigm for large-scale data analysis based on the discrete signal processing (DSP) on graphs (DSPG). DSPG extends signal processing concepts and methodologies from the classical signal processing theory to data indexed by general graphs. Big data analysis presents several challenges to DSPG, in particular, in filtering and frequency analysis of very large data sets. We review fundamental concepts of DSPG, including graph signals and graph filters, graph Fourier transform, graph frequency, and spectrum ordering, and compare them with their counterparts from the classical signal processing theory. We then consider product graphs as a graph model that helps extend the application of DSPG methods to large data sets through efficient implementation based on parallelization and vectorization. We relate the presented framework to existing methods for large-scale data processing and illustrate it with an application to data compression.
Keywords :
Big Data; Fourier transforms; data compression; data structures; graph theory; vectors; Big Data analysis; DSPG; data compression; data processing; data representation; data structure; discrete signal processing on graphs; graph Fourier transform; graph filters; graph frequency; graph signals; parallelization; spectrum ordering; vectorization; Big data; Data storage; Digital signal processing; Fourier transforms; Graph theory; Information analysis; Information processing; Time series analysis;
Journal_Title :
Signal Processing Magazine, IEEE
DOI :
10.1109/MSP.2014.2329213