Abstract :
The First International Symposium on Big Data and MapReduce (BigDataMR2012) is co-located with the Second International Conference on Cloud and Green Computing (CGC2012) held on November 1-3, 2012, Xiangtan, Hunan, China. Big data is an emerging paradigm applied to datasets whose size is beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time. Such datasets are often from various sources (Variety) yet unstructured such as social media, sensors, scientific applications, surveillance, video and image archives, Internet texts and documents, Internet search indexing, medical records, business transactions and web logs; and are of large size (Volume) with fast data in/out (Velocity). Various technologies are being discussed to support the handling of big data such as massively parallel processing databases, scalable storage systems, cloud computing platforms, and MapReduce. MapReduce is a distributed programming paradigm and an associated implementation to support distributed computing over large datasets on cloud. This symposium aims at providing a forum for researchers, practitioners and developers from different background areas such as cloud computing, distributed computing and database area to exchange the latest experience, research ideas and synergic research and development on fundamental issues and applications about big data and MapReduce in cloud environments. BigDataMR2012 contains 9 papers. Each of them was peer reviewed by at least three program committee members. The symposium covers a broad range of topics in the field of Big Data and MapReduce.