Title :
Geo-linguistic fingerprint and the evolution of languages in twitter
Author :
Altman, Eitan ; Portilla, Yonathan
Author_Institution :
INRIA, Sophia-Antipolis, France
Abstract :
Having access to content of messages sent by some given group of subscribers of a social network may be used to identify (and quantify) some features of that group. The feature can stand for the level of interest in some event or product, or for the popularity of some idea, or a musical hit or of a political figure. The feature can also stand for the way the written language is used and transformed, the way words are spelled and the way new grammatical rules appear. This paper has two goals. First, we identify features of groups of subscribers that have their geographic location and their language in common. We develop a methodology that allows one to perform such a study using freely available statistical tools which makes use of a part of all tweets which Twitter makes available for free over the Internet. The methodology is based on the fact that one can differentiate among some geographic areas according to the activity pattern of tweets during the time of the day. The second objective is to present our findings on the way spelling and new words have are used in Twitter. We analyze differences in appearance of new spellings among communities that are characterized by different locations but have a common language.
Keywords :
Internet; linguistics; natural language processing; social networking (online); statistical analysis; Internet; Twitter; freely available statistical tool; geolinguistic fingerprint; language evolution; message content; social network; Context; Google; Internet; Media; Standards; Twitter;
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2012 IEEE/ACM International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4673-2497-7
DOI :
10.1109/asonam.2012.6570664