Title :
Stratification driven placement of complex data: A framework for distributed data analytics
Author :
Ye Wang ; Parthasarathy, Srinivasan ; Sadayappan, P.
Author_Institution :
Comput. Sci. & Eng. Dept., Ohio State Univ., Columbus, OH, USA
Abstract :
With the increasing popularity of XML data stores, social networks and Web 2.0 and 3.0 applications, complex data formats, such as trees and graphs, are becoming ubiquitous. Managing and processing such large and complex data stores, on modern computational eco-systems, to realize actionable information efficiently, is an important challenge. A critical element at the heart of this challenge relates to the placement, storage and access of such tera- and peta- scale data. In this work we develop a novel distributed framework to ease the burden on the programmer and propose an agile and intelligent placement service layer as a flexible yet unified means to address this challenge. Central to our framework is the notion of stratification which seeks to initially group structurally (or semantically) similar entities into strata. Subsequently strata are partitioned within this ecosystem according to the needs of the application to maximize locality, balance load, or minimize data skew. Results on several real-world applications validate the efficacy and efficiency of our approach.
Keywords :
Internet; XML; data analysis; ecology; resource allocation; social networking (online); Web 2.0 applications; Web 3.0 applications; XML data stores; balance load; complex data formats; complex data storage; computational ecosystems; critical element; data skew minimization; distributed data analytics; distributed framework; intelligent placement service layer; locality maximization; petascale data; real-world applications; social networks; stratification driven placement; terascale data; Clustering algorithms; Communities; Distributed databases; Social network services; Sorting; XML;
Conference_Titel :
Data Engineering (ICDE), 2013 IEEE 29th International Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-4909-3
Electronic_ISBN :
1063-6382
DOI :
10.1109/ICDE.2013.6544868