DocumentCode
3730563
Title
Prediction of influenza epidemics at the province level in China using search query from “Haosou”
Author
Yuanqiang Zou;Yousong Peng; Li Lu;Taijiao Jiang; Lizong Deng
Author_Institution
College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
fYear
2015
Firstpage
1450
Lastpage
1454
Abstract
Influenza (Flu) has caused and will continue to cause large morbidity and mortality to human society. Surveillance is critical for prevention and control of it. Traditional methods for influenza surveillance are based on hierarchical surveillance networks, which are costly and time-consuming. The Internet-based method is a useful supplement to traditional methods in influenza surveillance. Here, we attempted to predict the daily influenza epidemics, represented as the number of influenza-like illness (ILI) cases, in three representative provinces in China (Beijing, Shanghai and Guangdong) based on the “Haosou” search engine. The search queries for several keywords related to influenza were observed to have medium to weak correlations to influenza epidemics in these provinces. Two kinds of statistical model, the multiple linear regression and regression tree, were built to predict the influenza epidemics based on the search queries of these keywords. Both of them achieved moderate performance in the cross-validation and retrospective-testing for each province, with the Pearson Correlation Coefficients between the predicted and observed influenza epidemics ranging from 0.35 to 0.78. Further, the unified models were built for prediction of influenza epidemics for all three provinces. Their poor performances suggest the difficulty of building a unified model for prediction of influenza epidemics for all provinces in China. Overall, this work shows the potential of predicting the daily influenza epidemics at the province level in China based on the search engine data. More efforts are needed to improve the statistical models for their use in practical influenza surveillance.
Keywords
"Influenza","Correlation","Surveillance","Search engines","Predictive models","Internet","Linear regression"
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2015 12th International Conference on
Type
conf
DOI
10.1109/FSKD.2015.7382157
Filename
7382157
Link To Document