Title :
Chinese Microblogger character analysis using SVM
Author :
Dan Tian ; Wei-Min Zheng
Author_Institution :
Guangzhou Inst. of Adv. Technol., Guangzhou, China
Abstract :
Microblogging provides a new platform for communicating and sharing information among Web users. Users can express opinions and record daily life using microblogs. Mircoblogs that are posted by users indicate the characters of users. In this paper, we focus on using Sina Weibo, the most popular Chinese microblogging platform, for the task of user character analysis. We define four categories features by analyzing microblogs, and show how to collect labeled corpus as training data. Using the corpus and via SVM (Support Vector Machines), we build a character classifier, which is able to determine extraversion or introversion for a microblogger. The experimental evaluations show that our method can identify users´ character accurately and efficiently.
Keywords :
Web sites; natural language processing; support vector machines; Chinese microblogger character analysis; SVM; Sina Weibo; Web users; character classifier; labeled corpus; support vector machines; Abstracts; Accuracy; Silicon; Character analysis; SVM; Sina Weibo;
Conference_Titel :
Wavelet Active Media Technology and Information Processing (ICWAMTIP), 2012 International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4673-1684-2
DOI :
10.1109/ICWAMTIP.2012.6413519