DocumentCode :
1791627
Title :
Analyzing the language of food on social media
Author :
Fried, Daniel ; Surdeanu, Mihai ; Kobourov, Stephen ; Hingle, Melanie ; Bell, David
Author_Institution :
Univ. of Arizona, Tucson, AZ, USA
fYear :
2014
fDate :
27-30 Oct. 2014
Firstpage :
778
Lastpage :
783
Abstract :
We investigate the predictive power behind the language of food on social media. We collect a corpus of over three million food-related posts from Twitter and demonstrate that many latent population characteristics can be directly predicted from this data: overweight rate, diabetes rate, political leaning, and home geographical location of authors. For all tasks, our language-based models significantly outperform the majority-class baselines. Performance is further improved with more complex natural language processing, such as topic modeling. We analyze which textual features have greatest predictive power for these datasets, providing insight into the connections between the language of food, geographic locale, and community characteristics. Lastly, we design and implement an online system for real-time query and visualization of the dataset. Visualization tools, such as geo-referenced heatmaps and temporal histograms, allow us to discover more complex, global patterns mirrored in the language of food.
Keywords :
food products; natural language processing; social networking (online); Twitter; community characteristics; complex natural language processing; dataset visualization; diabetes rate; food-related posts; geo-referenced heatmaps; geographic locale; home geographical location; language-based models; language-of-food; latent population characteristics; majority-class baselines; overweight rate; political leaning; predictive power; real-time query; social media; temporal histograms; textual features; topic modeling; visualization tools; Accuracy; Cities and towns; Context; Diabetes; Predictive models; Twitter; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
Type :
conf
DOI :
10.1109/BigData.2014.7004305
Filename :
7004305
Link To Document :
بازگشت