Title :
Pigeon: A spatial MapReduce language
Author :
Eldawy, Ahmed ; Mokbel, Mohamed F.
Author_Institution :
Comput. Sci. & Eng., Univ. of Minnesota, Minneapolis, MN, USA
fDate :
March 31 2014-April 4 2014
Abstract :
With the huge amounts of spatial data collected everyday, MapReduce frameworks, such as Hadoop, have become a common choice to analyze big spatial data for scientists and people from industry. Users prefer to use high level languages, such as Pig Latin, to deal with Hadoop for simplicity. Unfortunately, these languages are designed for primitive non-spatial data and have no support for spatial data types or functions. This demonstration presents Pigeon, a spatial extension to Pig which provides spatial functionality in Pig. Pigeon is implemented through user defined functions (UDFs) making it easy to use and compatible with all recent versions of Pig. This also allows it to integrate smoothly with existing non-spatial functions and operations such as Filter, Join and Group By. Pigeon is compatible with the Open Geospatial Consortium (OGC) standard which makes it easy to learn and use for users who are familiar with existing OGC-compliant tools such as PostGIS. This demonstrations shows to audience how to work with Pigeon through some interesting applications running on large scale real datasets extracted from OpenStreetMap.
Keywords :
Big Data; data analysis; high level languages; standards; visual databases; Hadoop; OGC standard; OGC-compliant tools; Open Geospatial Consortium standard; OpenStreetMap; Pig Latin; Pigeon; PostGIS; UDF; big spatial data analysis; filter; group by; high level languages; join; spatial MapReduce language; user defined functions; Cities and towns; Lakes; Rivers; Roads; Spatial databases; Standards; XML;
Conference_Titel :
Data Engineering (ICDE), 2014 IEEE 30th International Conference on
Conference_Location :
Chicago, IL
DOI :
10.1109/ICDE.2014.6816751