Title :
Towards methods for systematic research on big data
Author :
Manirupa Das;Renhao Cui;David R. Campbell;Gagan Agrawal;Rajiv Ramnath
Author_Institution :
Department of Computer Science and Engineering, The Ohio State University
Abstract :
Big Data is characterized by the five V´s - of Volume, Velocity, Variety, Veracity and Value. Research on Big Data, that is, the practice of gaining insights from it, challenges the intellectual, process, and computational limits of an enterprise. Leveraging the correct and appropriate toolset requires careful consideration of a large software ecosystem. Powerful algorithms exist, but the exploratory and often ad-hoc nature of analytic demands and a distinct lack of established processes and methodologies make it difficult for Big Data teams to set expectations or even create valid project plans. The exponential growth of data generated exceeds the capacity of humans to process it, and compels us to develop automated computing methods that require significant and expensive computing power in order to scale effectively. In this paper, we characterize data-driven practice and research and explore how we might design effective methods for systematizing such practice and research [19, 22]. Brief case studies are presented in order to ground our conclusions and insights.
Keywords :
"Blogs","Big data","Data mining","Predictive models","Feature extraction","Distributed databases"
Conference_Titel :
Big Data (Big Data), 2015 IEEE International Conference on
DOI :
10.1109/BigData.2015.7363989