• DocumentCode
    679954
  • Title

    A term weighting method for identifying emotions from text content

  • Author

    De Silva, Jenomi ; Haddela, P.S.

  • Author_Institution
    Dept. of Inf. Technol., Sri Lanka Inst. of Inf. Technol., Malabe, Sri Lanka
  • fYear
    2013
  • fDate
    17-20 Dec. 2013
  • Firstpage
    381
  • Lastpage
    386
  • Abstract
    Since the inception of the concept of social networking, communication patterns have shifted drastically with the unmitigated trend in socializing over the Internet, especially when people began connecting via mobile devices. Nowadays people tend to use these modern communication systems to share their emotions with each other. Human emotions play a vital role in human relationships and people share their emotions through facial expressions, gestures, speech and text messages. However, text messaging is the most common and widely accepted method to exchange information among peers through the Internet and mobile networks. In comparison to other methods, identifying emotions from text messages is rather difficult for the recipient. Therefore, the need of automating the emotion recognition from textual content has increased. Utilization of text classification techniques can be considered as the most common approach of identifying emotions from textual content. Prior to applying a text classifier, the textual data should be transformed into a data structure that the classifier understands by conforming to a document representation model and term weighting method. For this research Vector Space Model (VSM) is used as the document representation model. This paper proposes an extension to the Term Frequency - Inverse Document Frequency (TF-IDF) weighting method to increase classification accuracy and explains experiments conducted to discover the best term weighting method in vector space to be used in feature (text term) extraction from Aman´s emotion text corpus. The text classification is done using Oracle´s ODM SVM tool and LibSVM tool.
  • Keywords
    Internet; data structures; emotion recognition; mobile computing; pattern classification; social networking (online); support vector machines; text analysis; Aman emotion text corpus; Internet; LibSVM tool; Oracle ODM SVM tool; TF-IDF; VSM; classification accuracy; communication patterns; data structure; emotion identification; facial expressions; gestures; human relationships; mobile devices; modern communication systems; social networking; speech; term frequency-inverse document frequency; term weighting method; text classification techniques; text content; text messages; vector space model; Accuracy; Classification algorithms; Emotion recognition; Feature extraction; Support vector machine classification; Text categorization; Support Vector Machine; Term weighting; Text Classification; Vector Space Model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial and Information Systems (ICIIS), 2013 8th IEEE International Conference on
  • Conference_Location
    Peradeniya
  • Print_ISBN
    978-1-4799-0908-7
  • Type

    conf

  • DOI
    10.1109/ICIInfS.2013.6732014
  • Filename
    6732014