Title :
Data Mining to Classify Fog Events by Applying Cost-Sensitive Classifier
Author :
Zazzaro, Gaetano ; Pisano, Francesca Maria ; Mercogliano, Paola
Author_Institution :
Software Technol. Lab., CIRA (Italian Aerosp. Res. Centre), Capua, Italy
Abstract :
The present work illustrates a Data Mining application in the meteorological domain. In particular, this work illustrates the creation of some fog classifying local indices, based on the post-processing of meteorological variables. A dataset containing a total amount of 17396 records, collected in Trapani Milo station and a poor quote of 142 fog events was obtained, such rare event required some specific approaches of the Data Mining techniques in order to overcome the class imbalance problem: a Cost Sensitive Classifier (matched with Bayes Network). The obtained results were evaluated by means of adequate performance metrics able to highlight the classifying ability of an index with respect to the fog events and the no-fog events separately (confusion matrix, ROC, AUC, ...). The obtained models were tested over 4349 records; four models overcame the AUC threshold of 0.8 and, for one of them, the ROC curve showed a good result: 88% of fog events correctly predicted.
Keywords :
belief networks; data mining; fog; geophysics computing; meteorology; pattern classification; Bayes network; cost-sensitive classifier; data mining; fog events classifation; meteorological domain; no-fog events classification; Application software; Bayesian methods; Competitive intelligence; Costs; Data mining; Measurement; Meteorology; Predictive models; Software systems; Weather forecasting; Bayesian Classifiers; Cost Sensitive Classification; Data Mining; Fog Forecast;
Conference_Titel :
Complex, Intelligent and Software Intensive Systems (CISIS), 2010 International Conference on
Conference_Location :
Krakow
Print_ISBN :
978-1-4244-5917-9
DOI :
10.1109/CISIS.2010.233