Beginning with big data simplified

Author

Bedi, Punam ; Jindal, Vinita ; Gautam, Anjali

Author_Institution

Dept. of Comput. Sci., Univ. of Delhi, New Delhi, India

fYear

2014

fDate

5-6 Sept. 2014

Firstpage

1

Lastpage

7

Abstract

Big Data is a collection of datasets containing massive amount of data in the range of zettabytes and yottabytes. Organizations are facing difficulties in manipulating and managing this massive data as existing traditional database and software techniques are unable to process and analyze voluminous data. Dealing with Big Data requires new tools and techniques that can extract valuable information using some analytic process. Volume, Variety, Velocity, Value, Veracity, Variability and Complexity are attributes associated with Big Data in various works in the literature. In this paper, we briefly describe these existing attributes and also propose to add Viability, Cost and Consistency as new attributes to this set. This paper also discusses existing tools and techniques associated with Big Data. Fleet management is an evolving application of GPS data. It is taken as a case study in this work to illustrate various attributes of Big Data. This paper also presents the implementation of a sorting problem by varying Hadoop cluster sizes for the GPS data.

Keywords

Big Data; Global Positioning System; distributed processing; geographic information systems; GPS data; Hadoop cluster; analytic process; big data; fleet management; software techniques; traditional database; valuable information; Big data; Business; Databases; Global Positioning System; Real-time systems; Sorting; Vehicles; Big Data; C´s of Big Data; GPS data; Hadoop; Map Reduce; V´s of Big Data;

fLanguage

English

Publisher

ieee

Conference_Titel

Data Mining and Intelligent Computing (ICDMIC), 2014 International Conference on

Conference_Location

New Delhi

Print_ISBN

978-1-4799-4675-4

Type

conf

DOI

10.1109/ICDMIC.2014.6954229

Filename

6954229