• DocumentCode
    1114737
  • Title

    Affect Analysis of Web Forums and Blogs Using Correlation Ensembles

  • Author

    Abbasi, Ahmed ; Chen, Hsinchun ; Thoms, Sven ; Fu, Tianjun

  • Author_Institution
    Dept. of Manage. Inf. Syst., Arizona Univ., Tucson, AZ
  • Volume
    20
  • Issue
    9
  • fYear
    2008
  • Firstpage
    1168
  • Lastpage
    1180
  • Abstract
    Analysis of affective intensities in computer-mediated communication is important in order to allow a better understanding of online users´ emotions and preferences. Despite considerable research on textual affect classification, it is unclear which features and techniques are most effective. In this study, we compared several feature representations for affect analysis, including learned n-grams and various automatically and manually crafted affect lexicons. We also proposed the support vector regression correlation ensemble (SVRCE) method for enhanced classification of affect intensities. SVRCE uses an ensemble of classifiers each trained using a feature subset tailored toward classifying a single affect class. The ensemble is combined with affect correlation information to enable better prediction of emotive intensities. Experiments were conducted on four test beds encompassing web forums, blogs, and online stories. The results revealed that learned n-grams were more effective than lexicon-based affect representations. The findings also indicated that SVRCE outperformed comparison techniques, including Pace regression, semantic orientation, and WordNet models. Ablation testing showed that the improved performance of SVRCE was attributable to its use of feature ensembles as well as affect correlation information. A brief case study was conducted to illustrate the utility of the features and techniques for affect analysis of large archives of online discourse.
  • Keywords
    Web sites; regression analysis; support vector machines; Web blogs; Web forums; WordNet models; affect analysis; computer-mediated communication; correlation ensembles; feature representations; lexicon-based affect representations; pace regression; semantic orientation; support vector regression correlation ensemble method; textual affect classification; Discourse; Linguistic processing; Machine learning; Text mining;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2008.51
  • Filename
    4479460