• DocumentCode
    2978772
  • Title

    Chinese Microblogger character analysis using SVM

  • Author

    Dan Tian ; Wei-Min Zheng

  • Author_Institution
    Guangzhou Inst. of Adv. Technol., Guangzhou, China
  • fYear
    2012
  • fDate
    17-19 Dec. 2012
  • Firstpage
    385
  • Lastpage
    389
  • Abstract
    Microblogging provides a new platform for communicating and sharing information among Web users. Users can express opinions and record daily life using microblogs. Mircoblogs that are posted by users indicate the characters of users. In this paper, we focus on using Sina Weibo, the most popular Chinese microblogging platform, for the task of user character analysis. We define four categories features by analyzing microblogs, and show how to collect labeled corpus as training data. Using the corpus and via SVM (Support Vector Machines), we build a character classifier, which is able to determine extraversion or introversion for a microblogger. The experimental evaluations show that our method can identify users´ character accurately and efficiently.
  • Keywords
    Web sites; natural language processing; support vector machines; Chinese microblogger character analysis; SVM; Sina Weibo; Web users; character classifier; labeled corpus; support vector machines; Abstracts; Accuracy; Silicon; Character analysis; SVM; Sina Weibo;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Wavelet Active Media Technology and Information Processing (ICWAMTIP), 2012 International Conference on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-1-4673-1684-2
  • Type

    conf

  • DOI
    10.1109/ICWAMTIP.2012.6413519
  • Filename
    6413519