DocumentCode :
3157176
Title :
Twitter vs. printed English: An information-theoretic comparison
Author :
Glennon, Emma ; Sankar, Lalitha ; Poor, H. Vincent
Author_Institution :
Dept. of Electr. Eng., Princeton Univ., Princeton, NJ, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
3069
Lastpage :
3072
Abstract :
The popular social networking and microblogging service Twitter contains language that is very different from what is considered proper. This paper quantifies those linguistic differences between printed English and Tweetspeak using information-theoretic concepts. Letter-based n-gram entropies are calculated and compared to analagous data from two corpora of printed English to demonstrate that 1) Twitter´s entropy is overall higher than that of printed English, and 2) individual users´ entropies are on average higher the less conventional their language use is. The implications for digitally-mediated communication in general are also discussed.
Keywords :
computer mediated communication; entropy; linguistics; social networking (online); Tweetspeak; Twitter; digitally-mediated communication; information-theoretic comparison; letter-based n-gram entropies; linguistic differences; microblogging service; printed English; social networking service; Educational institutions; Entropy; Handicapped aids; Radio access networks; Redundancy; Standards; Twitter; Twitter; computer mediated communication; information entropy; information theory; redundancy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288563
Filename :
6288563
Link To Document :
بازگشت