DocumentCode
3051157
Title
Asynchronous Index Strategy for high performance real-time big data stream storage
Author
Xiao Mo ; Hao Wang
Author_Institution
Pattern Reconigtion & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2012
fDate
21-23 Sept. 2012
Firstpage
232
Lastpage
236
Abstract
Big data insert-intensive applications challenge traditional RDBMS. Key-Value databases achieve the same throughput with much more price/performance ratio, which makes them popular recent years. However, Key-Value databases are not suitable for high performance real-time applications. In this paper we introduce Asynchronous Index Strategy as a high performance solution for insert-intensive time series big data storage. It takes advantage of partial replication and asynchronous indexes, which results in zero overhead for index updates. Furthermore, a general middle-ware for clustering databases based on Asynchronous Index Strategy is implemented. Finally, indexing and inserting performance experiments highlight the efficiency of Asynchronous Index Strategy. As for AIS based on MongoDB, it achieves a throughput that is 17 times of MongoDB sharding cluster.
Keywords
indexing; middleware; pattern clustering; relational databases; AIS; MongoDB sharding cluster; RDBMS; asynchronous index strategy; database clustering; general middleware; high performance real-time big data stream storage; insert-intensive time series big data storage; key-value databases; partial replication; price-performance ratio; relational database management system; zero overhead; Availability; Data handling; Data storage systems; Indexes; Information management; Throughput; Big data; Database clustering; Insert-intensive;
fLanguage
English
Publisher
ieee
Conference_Titel
Network Infrastructure and Digital Content (IC-NIDC), 2012 3rd IEEE International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4673-2201-0
Type
conf
DOI
10.1109/ICNIDC.2012.6418750
Filename
6418750
Link To Document