DocumentCode
3717265
Title
PAIRS: A scalable geo-spatial data analytics platform
Author
Levente J Klein;Fernando J Marianno;Conrad M Albrecht;Marcus Freitag;Siyuan Lu;Nigel Hinds;Xiaoyan Shao;Sergio Bermudez Rodriguez;Hendrik F Hamann
Author_Institution
IBM TJ Watson Research Center Yorktown Heights, NY 10598
fYear
2015
Firstpage
1290
Lastpage
1298
Abstract
Geospatial data volume exceeds hundreds of Petabytes and is increasing exponentially mainly driven by images/videos/data generated by mobile devices and high resolution imaging systems. Fast data discovery on historical archives and/or real time datasets is currently limited by various data formats that have different projections and spatial resolution, requiring extensive data processing before analytics can be carried out. A new platform called Physical Analytics Integrated Repository and Services (PAIRS) is presented that enables rapid data discovery by automatically updating, joining, and homogenizing data layers in space and time. Built on top of open source big data software, PAIRS manages automatic data download, data curation, and scalable storage while being simultaneously a computational platform for running physical and statistical models on the curated datasets. By addressing data curation before data being uploaded to the platform, multi-layer queries and filtering can be performed in real time. In addition, PAIRS offers a foundation for developing custom analytics. Towards that end we present two examples with models which are running operationally: (1) high resolution evapo-transpiration and vegetation monitoring for agriculture and (2) hyperlocal weather forecasting driven by machine learning for renewable energy forecasting.
Keywords
"Geospatial analysis","Satellites","Spatial resolution","Real-time systems","Data models","Weather forecasting"
Publisher
ieee
Conference_Titel
Big Data (Big Data), 2015 IEEE International Conference on
Type
conf
DOI
10.1109/BigData.2015.7363884
Filename
7363884
Link To Document