DocumentCode
1791756
Title
Predicting flight arrival times with a multistage model
Author
Takacs, Gabor
Author_Institution
Dept. of Math. & Comput. Sci., Szechenyi Istvan Univ., Györ, Hungary
fYear
2014
fDate
27-30 Oct. 2014
Firstpage
78
Lastpage
84
Abstract
Airlines are constantly looking for ways to cut flight delays, in order to enhance service quality and reduce operational costs. The goal of the data science contest, GE Flight Quest (https://www.gequest.com/c/flight), was to make flights more efficient by improving the accuracy of arrival time estimates. The data set of the contest was 128 GB in size and contained 252 data columns arranged in 34 tables. This paper presents my solution that won third prize under team name Taki. The solution employs a 6-stage model consisting of successive ridge regressions and gradient boosting machines, built on 56 features constructed from the raw data. The hardware environment used for training and running the model was a 64 core machine with 1 terabyte of memory.
Keywords
Big Data; air traffic; data analysis; learning (artificial intelligence); regression analysis; traffic engineering computing; airlines; flight arrival times prediction; gradient boosting machines; multistage model; real-time big data analysis; ridge regressions; Airports; Atmospheric modeling; Delays; Logic gates; Meteorology; Predictive models; Training; GE Flight Quest; gradient boosting machine; parallelization; ridge regression;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location
Washington, DC
Type
conf
DOI
10.1109/BigData.2014.7004435
Filename
7004435
Link To Document