DocumentCode :
1932058
Title :
Assessing Data Virtualization for Irregularly Replicated Large Datasets
Author :
Diniz, Bruno ; Nogueira, Diêgo L. ; Cardoso, Andre ; Ferreira, Renato A. ; Guedes, Dorgival ; Meira, Wagner, Jr.
Author_Institution :
Federal University of Minas Gerais, Brazil
Volume :
1
fYear :
2006
fDate :
16-19 May 2006
Firstpage :
505
Lastpage :
512
Abstract :
Large volumes of data are generated every day by experiments, simulations and all sorts of applications. It is common to observe situations where portions of data are irregularly replicated and distributed in different data sources. It would be desirable to be able to handle these several pieces of irregular data (replicated or not) as a unique large dataset. This is called data virtualization and is the focus of this paper. In this paper, we present a system which is capable of dealing with irregularly replicated data and is able to create a virtual view of the union of the individual irregular portions of data hosted by each data source. Our system indexes the data intervals from each data source and allows clients to submit queries against the virtual dataset created. In order to select what server will be responsible for each data interval of a query, we use and compare three algorithms, namely Random, Round-Robin and Weighted Round-Robin. The comparison is driven by simulation and the parameters for the simulation are all taken from a real data-centered application (the Virtual Microscope).
Keywords :
Application software; Computational modeling; Computer science; Computer simulation; Extraterrestrial phenomena; Grid computing; Microscopy; Physics; Query processing; Round robin;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing and the Grid, 2006. CCGRID 06. Sixth IEEE International Symposium on
Conference_Location :
Singapore
Print_ISBN :
0-7695-2585-7
Type :
conf
DOI :
10.1109/CCGRID.2006.21
Filename :
1630863
Link To Document :
بازگشت