Title of article :
Twitter mining for fine-grained syndromic surveillance
Author/Authors :
Velardi، نويسنده , , Paola and Stilo، نويسنده , , Giovanni and Tozzi، نويسنده , , Alberto E. and Gesualdo، نويسنده , , Francesco، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2014
Pages :
11
From page :
153
To page :
163
Abstract :
AbstractBackground l traces left on the Internet by web users, if properly aggregated and analyzed, can represent a huge information dataset able to inform syndromic surveillance systems in real time with data collected directly from individuals. Since people use everyday language rather than medical jargon (e.g. runny nose vs. respiratory distress), knowledge of patients’ terminology is essential for the mining of health related conversations on social networks. ives s paper we present a methodology for early detection and analysis of epidemics based on mining Twitter messages. In order to reliably trace messages of patients that actually complain of a disease, first, we learn a model of naïve medical language, second, we adopt a symptom-driven, rather than disease-driven, keyword analysis. This approach represents a major innovation compared to previous published work in the field. st developed an algorithm to automatically learn a variety of expressions that people use to describe their health conditions, thus improving our ability to detect health-related “concepts” expressed in non-medical terms and, in the end, producing a larger body of evidence. We then implemented a Twitter monitoring instrument to finely analyze the presence and combinations of symptoms in tweets. s st evaluate the algorithmʹs performance on an available dataset of diverse medical condition synonyms, then, we assess its utility in a case study of five common syndromes for surveillance purposes. We show that, by exploiting physicians’ knowledge on symptoms positively or negatively related to a given disease, as well as the correspondence between patients’ “naïve” terminology and medical jargon, not only can we analyze large volumes of Twitter messages related to that disease, but we can also mine micro-blogs with complex queries, performing fine-grained tweets classification (e.g. those reporting influenza-like illness (ILI) symptoms vs. common cold or allergy). sions proach yields a very high level of correlation with flu trends derived from traditional surveillance systems. Compared with Google Flu, another popular tool based on query search volumes, our method is more flexible and less sensitive to changes in web search behaviors.
Keywords :
Terminology clustering , Twitter mining , Micro-blog mining , Patientיs language learning , Syndromic Surveillance
Journal title :
Artificial Intelligence In Medicine
Serial Year :
2014
Journal title :
Artificial Intelligence In Medicine
Record number :
1841749
Link To Document :
بازگشت