DocumentCode :
1791756
Title :
Predicting flight arrival times with a multistage model
Author :
Takacs, Gabor
Author_Institution :
Dept. of Math. & Comput. Sci., Szechenyi Istvan Univ., Györ, Hungary
fYear :
2014
fDate :
27-30 Oct. 2014
Firstpage :
78
Lastpage :
84
Abstract :
Airlines are constantly looking for ways to cut flight delays, in order to enhance service quality and reduce operational costs. The goal of the data science contest, GE Flight Quest (https://www.gequest.com/c/flight), was to make flights more efficient by improving the accuracy of arrival time estimates. The data set of the contest was 128 GB in size and contained 252 data columns arranged in 34 tables. This paper presents my solution that won third prize under team name Taki. The solution employs a 6-stage model consisting of successive ridge regressions and gradient boosting machines, built on 56 features constructed from the raw data. The hardware environment used for training and running the model was a 64 core machine with 1 terabyte of memory.
Keywords :
Big Data; air traffic; data analysis; learning (artificial intelligence); regression analysis; traffic engineering computing; airlines; flight arrival times prediction; gradient boosting machines; multistage model; real-time big data analysis; ridge regressions; Airports; Atmospheric modeling; Delays; Logic gates; Meteorology; Predictive models; Training; GE Flight Quest; gradient boosting machine; parallelization; ridge regression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
Type :
conf
DOI :
10.1109/BigData.2014.7004435
Filename :
7004435
Link To Document :
بازگشت