DocumentCode :
3453127
Title :
Improving access to multi-dimensional self-describing scientific datasets
Author :
Nam, Beomseok ; Sussman, Alan
Author_Institution :
Dept. of Comput. Sci., Univ. of Maryland, College Park, MD, USA
fYear :
2003
fDate :
12-15 May 2003
Firstpage :
172
Lastpage :
179
Abstract :
Applications that query into very large multidimensional datasets are becoming more common. Many self-describing scientific data file formats have also emerged, which have structural metadata to help navigate the multi-dimensional arrays that are stored in the files. The files may also contain application-specific semantic metadata. In this paper, we discuss efficient methods for performing searches for subsets of multi-dimensional data objects, using semantic information to build multidimensional indexes, and group data items into properly sized chunks to maximize disk I/O bandwidth. This work is the first step in the design and implementation of a generic indexing library that will work with various high-dimension scientific data file formats containing semantic information about the stored data. To validate the approach, we have implemented indexing structures for NASA remote sensing data stored in the HDF format with a specific schema (HDF-EOS), and show the performance improvements that are gained from indexing the datasets, compared to using the existing HDF library for accessing the data.
Keywords :
database indexing; distributed databases; meta data; query formulation; very large databases; NASA remote sensing data; application-specific semantic metadata; disk I/O bandwidth; indexing structures; multidimensional arrays; multidimensional datasets; self-describing scientific data file formats; structural metadata; Application software; Bandwidth; Computer science; Educational institutions; Indexing; Libraries; Middleware; Multidimensional systems; NASA; Navigation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing and the Grid, 2003. Proceedings. CCGrid 2003. 3rd IEEE/ACM International Symposium on
Print_ISBN :
0-7695-1919-9
Type :
conf
DOI :
10.1109/CCGRID.2003.1199366
Filename :
1199366
Link To Document :
بازگشت