DocumentCode
2025192
Title
Reduce, You Say: What NoSQL Can Do for Data Aggregation and BI in Large Repositories
Author
Bonnet, Laurent ; Laurent, Anne ; Sala, Michel ; Laurent, Bénédicte ; Sicard, Nicolas
Author_Institution
LIRMM, Univ. Montpellier 2, Montpellier, France
fYear
2011
fDate
Aug. 29 2011-Sept. 2 2011
Firstpage
483
Lastpage
488
Abstract
Data aggregation is one of the key features used in databases, especially for Business Intelligence (e.g., ETL, OLAP) and analytics/data mining. When considering SQL databases, aggregation is used to prepare and visualize data for deeper analyses. However, these operations are often impossible on very large volumes of data regarding memory-and-time-consumption. In this paper, we show how NoSQL databases such as MongoDB and its key-value stores, thanks to the native MapReduce algorithm, can provide an efficient framework to aggregate large volumes of data. We provide basic material about the MapReduce algorithm, the different NoSQL databases (read intensive vs. write intensive). We investigate how to efficiently modelize the data framework for BI and analytics. For this purpose, we focus on read intensive NoSQL databases using MongoDB and we show how NoSQL and MapReduce can help handling large volumes of data.
Keywords
SQL; competitive intelligence; data mining; MapReduce algorithm; MongoDB; NoSQL databases; business intelligence; data aggregation; data mining; large repositories; memory-and-time-consumption; Aggregates; Arrays; Availability; Data models; Database systems; Distributed databases; Data Aggregation; MapReduce; Massive Data Sets; Mon-goDB; NoSQL; Read Intensive; SQL; Write Intensive;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Applications (DEXA), 2011 22nd International Workshop on
Conference_Location
Toulouse
ISSN
1529-4188
Print_ISBN
978-1-4577-0982-1
Type
conf
DOI
10.1109/DEXA.2011.71
Filename
6059864
Link To Document