Title :
HCTrie: A structure for indexing hundreds of dimensions for use in file systems search
Author_Institution :
Storage Syst. Res. Center & Comput. Sci. Dept., Univ. of California, Santa Cruz, Santa Cruz, CA, USA
Abstract :
Data management in large-scale storage systems involves indexing and search for data objects (e.g., files). There are hundreds of types of metadata attributed to the data objects: examples include environmental settings of photograph files and simulation configurations for simulation output files. To provide intelligent file search that uses file metadata, we introduce a novel search structure called Hyper-Cube Trie (HCTrie), that can handle a few hundred dimensions of data attributes. HCTrie can utilize the differences in many dimensions effectively: candidates can be pruned based on differences in all dimensions. To the best of our knowledge, this is the first approach to restrain the memory growth to a linear scale against the number of dimensions, when multiple dimensions are indexed at the same time. Our prototype has successfully indexed five million data entries with one hundred attributes in a single data structure. We show that HCTrie can outperform MySQL in range search where ranges for less than 100 dimensions are specified in the search query.
Keywords :
SQL; database indexing; meta data; search problems; storage management; tree data structures; HCTrie; MySQL; data attributes; data entry; data management; data objects indexing; data objects search; data structure; environmental settings; file metadata; file systems search; hyper-cube trie; intelligent file search; large-scale storage systems; linear scale; memory growth; multiple dimensions; photograph files; search query; search structure; simulation configurations; simulation output files; Arrays; Complexity theory; Hypercubes; Indexing; Prototypes;
Conference_Titel :
Mass Storage Systems and Technologies (MSST), 2013 IEEE 29th Symposium on
Conference_Location :
Long Beach, CA
Print_ISBN :
978-1-4799-0217-0
DOI :
10.1109/MSST.2013.6558431