• DocumentCode
    3720035
  • Title

    Prediction of influenza outbreaks by integrating Wikipedia article access logs and Google flu trend data

  • Author

    Batuhan Bardak;Mehmet Tan

  • Author_Institution
    Department of Computer Engineering, TOBB University of Economics and Technology, 06510 Ankara, Turkey
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Prediction of influenza outbreaks is of utmost importance for health practitioners, officers and people. After the increasing usage of internet, it became easier and more valuable to fetch and process internet search query data. There are two significant platforms that people widely use, Google and Wikipedia. In both platforms, access logs are available which means that we can see how often any query/article was searched. Google has its own web service for monitoring and forecasting influenza-illness which is called the Google Flu Trends. It provides estimates of influenza activity for some countries. The second alternative is Wikipedia access logs which provide the number of visits for the articles on Wikipedia. There are papers which work with these platforms separately. In this paper, we propose a new technique to use these two sources together to improve the prediction of influenza outbreaks. We achieved promising results for both nowcasting and forecasting with linear regression models.
  • Keywords
    "Internet","Encyclopedias","Electronic publishing","Google","Market research","Predictive models"
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering (BIBE), 2015 IEEE 15th International Conference on
  • Type

    conf

  • DOI
    10.1109/BIBE.2015.7367640
  • Filename
    7367640