DocumentCode :
2000094
Title :
Data Evolution Analysis of Virtual DataSpace for Managing the Big Data Lifecycle
Author :
Xin Cheng ; Chungjin Hu ; Yang Li ; Wei Lin ; Haolei Zuo
Author_Institution :
Sch. of Comput. & Commun. Eng., Univ. of Sci. & Technol. Beijing, Beijing, China
fYear :
2013
fDate :
20-24 May 2013
Firstpage :
2054
Lastpage :
2063
Abstract :
New challenge about the constantly changing of associated data in big data management has arisen, which leads to the issue of data evolution. In this paper, a data evolution model of Virtual Data Space (VDS) is proposed for managing the big data lifecycle. Firstly, the concept of data evolution cycle is defined, and the lifecycle process of big data management is described. Based on these, the data evolution lifecycle is analyzed from the data relationship, the user requirements, and the operation behavior. Secondly, the classification and key concepts about the data evolution process are described in detail. According to this, the data evolution model is constructed by defining the related concepts and analyzing the data association in VDS, for the capture and tracking of dynamic data in the data evolution cycle. Then we discuss the cost problem about data dissemination and change. Finally, as the application case, the service process of dynamic data in the field of materials science is described and analyzed. We verify the validity of data evolution modeling in VDS by the comparison of traditional database, data space, and VDS. It shows that this analysis method is efficient for the data evolution processing, and very suitable for the data-intensive application and the real-time dynamic service.
Keywords :
data analysis; information dissemination; materials science computing; sensor fusion; VDS; Virtual DataSpace; big data lifecycle process management; data association; data dissemination; data evolution cycle; data evolution model; data evolution process; data evolution processing; data relationship; data-intensive application; dynamic data capture; dynamic data tracking; materials science; operation behavior; real-time dynamic service; user requirements; Analytical models; Data handling; Data models; Data storage systems; Distributed databases; Information management; Semantics; Big Data; Data Evolution; Lifecycle; Virtual DataSpace (VDS);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
Conference_Location :
Cambridge, MA
Print_ISBN :
978-0-7695-4979-8
Type :
conf
DOI :
10.1109/IPDPSW.2013.57
Filename :
6651110
Link To Document :
بازگشت