DocumentCode
151473
Title
Beginning with big data simplified
Author
Bedi, Punam ; Jindal, Vinita ; Gautam, Anjali
Author_Institution
Dept. of Comput. Sci., Univ. of Delhi, New Delhi, India
fYear
2014
fDate
5-6 Sept. 2014
Firstpage
1
Lastpage
7
Abstract
Big Data is a collection of datasets containing massive amount of data in the range of zettabytes and yottabytes. Organizations are facing difficulties in manipulating and managing this massive data as existing traditional database and software techniques are unable to process and analyze voluminous data. Dealing with Big Data requires new tools and techniques that can extract valuable information using some analytic process. Volume, Variety, Velocity, Value, Veracity, Variability and Complexity are attributes associated with Big Data in various works in the literature. In this paper, we briefly describe these existing attributes and also propose to add Viability, Cost and Consistency as new attributes to this set. This paper also discusses existing tools and techniques associated with Big Data. Fleet management is an evolving application of GPS data. It is taken as a case study in this work to illustrate various attributes of Big Data. This paper also presents the implementation of a sorting problem by varying Hadoop cluster sizes for the GPS data.
Keywords
Big Data; Global Positioning System; distributed processing; geographic information systems; GPS data; Hadoop cluster; analytic process; big data; fleet management; software techniques; traditional database; valuable information; Big data; Business; Databases; Global Positioning System; Real-time systems; Sorting; Vehicles; Big Data; C´s of Big Data; GPS data; Hadoop; Map Reduce; V´s of Big Data;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining and Intelligent Computing (ICDMIC), 2014 International Conference on
Conference_Location
New Delhi
Print_ISBN
978-1-4799-4675-4
Type
conf
DOI
10.1109/ICDMIC.2014.6954229
Filename
6954229
Link To Document