DocumentCode :
3069446
Title :
A study into the capabilities of NoSQL databases in handling a highly heterogeneous tree
Author :
Jayathilake, D. ; Sooriaarachchi, C. ; Gunawardena, T. ; Kulasuriya, B. ; Dayaratne, T.
Author_Institution :
99X Technol., Colombo, Sri Lanka
fYear :
2012
fDate :
27-29 Sept. 2012
Firstpage :
106
Lastpage :
111
Abstract :
This paper comprehends our work on assessing the feasibility of utilizing different NoSQL databases in handling a huge tree data structure with heterogeneous nodes in which heterogeneity implies that each node can embody a unique attribute set. It is a prominent requirement arising in structured log analysis where constituents in a software log file are scrutinized hierarchically. Traditional pills from relational databases fail in handling this efficiently. We lean towards NoSQL paradigm, which has been emerging as a prominent solution for dealing with high volumes of data with localized characteristics. Our exploration probes five different NoSQL models: wide column store, document store, tuple store, graph databases and multi-model databases that collectively account for a large fraction of the entire NoSQL spectrum. An experiment is designed to measure database performance against a generic tree API focusing on node insertion, node query and attribute-value query. The API is then implemented in a database selected from each of the five NoSQL models in concern. Implementations are used for testing the database performance with respect to the three operations by measuring time taken for a batch of similar operations in a machine with average hardware and software configuration. A summary of experiment results is provided along with the details on tree implementation methodology in each database. A discussion that highlights the congruence between observed performance differences among databases and the theoretical NoSQL models they represent is also included.
Keywords :
SQL; application program interfaces; distributed databases; tree data structures; trees (mathematics); NoSQL databases; NoSQL spectrum; attribute set; attribute-value query; database performance testing; document store; generic tree API; graph databases; hardware configuration; heterogeneous nodes; highly heterogeneous tree handling; huge tree data structure handling; localized characteristics; multimodel databases; node insertion; node query; software configuration; software log file; structured log analysis; tuple store; wide column store; Arrays; Distributed databases; Indexing; Servers; Time measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Automation for Sustainability (ICIAfS), 2012 IEEE 6th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-1976-8
Type :
conf
DOI :
10.1109/ICIAFS.2012.6419890
Filename :
6419890
Link To Document :
بازگشت