• DocumentCode
    2193108
  • Title

    Efficient techniques for range search queries on earth science data

  • Author

    Shi, Qingmin ; JaJa, Joseph F.

  • Author_Institution
    Inst. for Adv. Comput. Studies, Maryland Univ., College Park, MD, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    142
  • Lastpage
    151
  • Abstract
    We consider the problem of organizing large scale earth science raster data to efficiently handle queries for identifying regions whose parameters fall within certain range values specified by the queries. This problem seems to be critical to enabling basic data mining tasks such as determining associations between physical phenomena and spatial factors, detecting changes and trends, and content based retrieval. We assume that the input is too large to fit in internal memory and hence focus on data structures and algorithms that minimize the I/O bounds. A new data structure, called a tree-of-regions (ToR), is introduced and involves a combination of an R-tree and efficient representation of regions. It is shown that such a data structure enables the handling of range queries in an optimal I/O time, under certain reasonable assumptions. We also show that updates to the ToR can be handled efficiently. Experimental results for a variety of multi-valued earth science data illustrate the fast execution times of a wide range of queries, as predicted by our theoretical analysis.
  • Keywords
    data mining; natural sciences computing; query processing; temporal databases; tree data structures; visual databases; content based retrieval; data mining tasks; data structures; earth science data; large scale raster data; range queries; range search queries; spatial factors; tree-of-regions; Computational modeling; Content based retrieval; Data mining; Data structures; Educational institutions; Geoscience; Information retrieval; Large-scale systems; Organizing; Tree data structures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Scientific and Statistical Database Management, 2002. Proceedings. 14th International Conference on
  • ISSN
    1099-3371
  • Print_ISBN
    0-7695-1632-7
  • Type

    conf

  • DOI
    10.1109/SSDM.2002.1029714
  • Filename
    1029714