DocumentCode :
2895770
Title :
Hybrid classification for tweets related to infection with influenza
Author :
Xiangfeng Dai ; Bikdash, Marwan
Author_Institution :
Dept. of Comput. Sci. & Eng., North Carolina A&T State Univ., Greensboro, NC, USA
fYear :
2015
fDate :
9-12 April 2015
Firstpage :
1
Lastpage :
5
Abstract :
Traditional public health surveillance methods such as those employed by the CDC (United States Centers for Disease Control and Prevention) rely on regular clinical reports, which are almost always manual and labor intensive. Twitter, a popular micro-blogging service, provides the possibility of automated public health surveillance. Tweets, however, are less than 140 characters, and do not provide sufficient word occurrences for conventional classification methods to work reliably. Moreover, natural language is complex. This makes health-related classification more challenging. In this study, we use flu-related classification as a demonstration to propose a hybrid classification method, which combines two classification approaches: manually- defined features and auto-generated features by machine learning approaches. Preprocessing based on Natural Language Processing (NLP) is used to help extract useful information, and to eliminate noise features. Our simulations show an improved accuracy.
Keywords :
health care; learning (artificial intelligence); medical computing; natural language processing; pattern classification; social networking (online); CDC; NLP; Twitter; United States Centers for Disease Control and Prevention; auto-generated features classification; flu-related classification; health-related classification; hybrid classification method; hybrid tweet classification; influenza infection; information extraction; machine learning approach; manually-defined features classification; natural language processing; noise feature elimination; public health surveillance methods; Influenza; Manuals; Natural language processing; Public healthcare; Surveillance; Training; Twitter; Big data; Classification; Machine learning; Natural Language Processing; Public Health; Social Network; Twitter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
SoutheastCon 2015
Conference_Location :
Fort Lauderdale, FL
Type :
conf
DOI :
10.1109/SECON.2015.7133015
Filename :
7133015
Link To Document :
بازگشت