DocumentCode :
175879
Title :
Discriminating gender on Chinese microblog: A study of online behaviour, writing style and preferred vocabulary
Author :
Li Li ; Maosong Sun ; Zhiyuan Liu
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
fYear :
2014
fDate :
19-21 Aug. 2014
Firstpage :
812
Lastpage :
817
Abstract :
As user attributes are useful for applications such as personalized recommendation, adverting and so on, user attribute predication on Twitter has attracted intensive attentions in recent years. Although Chinese micro-blogging services are different from Twitter on various aspects such as language, user behaviours and so on, few efforts have been made on Chinese micro-blogging services. In this paper, we propose a gender prediction model for Chinese microblog which exploits features including online behaviour, writing style, and preferred vocabulary. Experimental results on Sina Weibo, which is one of the most popular micro-blogging services in China, show that our model achieves the state-of-the-art accuracy 94.3%. We also find significant distinctions between male and female microblog users on online behaviour, writing style and preferred vocabulary, which would be helpful for improving personalized applications.
Keywords :
Internet; Web sites; gender issues; vocabulary; Chinese microblogging services; Twitter; gender prediction model; online behaviour; personalized recommendation; preferred vocabulary; writing style; Accuracy; Fans; Feature extraction; Predictive models; Twitter; Vocabulary; Writing; Chinese microblog; gender prediction; user behaviour analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Computation (ICNC), 2014 10th International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4799-5150-5
Type :
conf
DOI :
10.1109/ICNC.2014.6975942
Filename :
6975942
Link To Document :
بازگشت