DocumentCode :
2774404
Title :
Predicting the Political Alignment of Twitter Users
Author :
Conover, Michael D. ; Gonçalves, Bruno ; Ratkiewicz, Jacob ; Flammini, Alessandro ; Menczer, Filippo
Author_Institution :
Center for Complex Networks & Syst. Res., Indiana Univ., Bloomington, IN, USA
fYear :
2011
fDate :
9-11 Oct. 2011
Firstpage :
192
Lastpage :
199
Abstract :
The widespread adoption of social media for political communication creates unprecedented opportunities to monitor the opinions of large numbers of politically active individuals in real time. However, without a way to distinguish between users of opposing political alignments, conflicting signals at the individual level may, in the aggregate, obscure partisan differences in opinion that are important to political strategy. In this article we describe several methods for predicting the political alignment of Twitter users based on the content and structure of their political communication in the run-up to the 2010 U.S. midterm elections. Using a data set of 1,000 manually-annotated individuals, we find that a support vector machine (SVM) trained on hash tag metadata outperforms an SVM trained on the full text of users´ tweets, yielding predictions of political affiliations with 91% accuracy. Applying latent semantic analysis to the content of users´ tweets we identify hidden structure in the data strongly associated with political affiliation, but do not find that topic detection improves prediction performance. All of these content-based methods are outperformed by a classifier based on the segregated community structure of political information diffusion networks (95% accuracy). We conclude with a practical application of this machinery to web-based political advertising, and outline several approaches to public opinion monitoring based on the techniques developed herein.
Keywords :
Internet; politics; social networking (online); support vector machines; Twitter users; Web-based political advertising; content-based method; hash tag metadata; latent semantic analysis; opinions monitoring; political affiliation; political alignment prediction; political communication; political information diffusion network; political strategy; public opinion monitoring; social media; support vector machine training; yielding prediction; Accuracy; Data mining; Real time systems; Semantics; Support vector machines; Twitter; Vectors; data mining; machine leaning; networks; polarization; political science; social media; text mining; twitter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), 2011 IEEE Third International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4577-1931-8
Type :
conf
DOI :
10.1109/PASSAT/SocialCom.2011.34
Filename :
6113114
Link To Document :
بازگشت