• DocumentCode
    3700254
  • Title

    Sentiment analysis of mixed language employing Hindi-English code switching

  • Author

    Dinkar Sitaram;Savitha Murthy;Debraj Ray;Devansh Sharma;Kashyap Dhar

  • Author_Institution
    Center for Cloud computing and Big Data (CCBD), PES University, Banashankari Stage III. Bangalore 560085
  • Volume
    1
  • fYear
    2015
  • fDate
    7/1/2015 12:00:00 AM
  • Firstpage
    271
  • Lastpage
    276
  • Abstract
    Sentiment analysis has emerged as one of the prominent research branches because of its endless usages and applications. Monitoring social media, forums, blogs and other online resources for customer reviews, product competition and survey responses to understand customer insight is of significant importance in business analytics. With the proliferation of informal user generated data online, the use of mixed language has become a common phenomenon. Mixed language arises through the use of linguistic code switching (LCS) or the practice of using more than one language in a single sentence. Such mixed language has rarely been a subject of sentiment analysis before. The lack of a clear grammatical structure renders the previous approaches to sentiment analysis ineffective for such text. In this paper, we propose a strategy to determine the sentiment of sentences written in a mixed language comprising of Hindi and English lexicons. Our technique can be used to analyze the sentiment of data belonging to any one of the source languages as well as the mixed language data. Grammatical transitions which are very common in mixed language have been taken into account during the sentiment analysis. We demonstrate the effectiveness of the proposed approach via case studies on social media data sets.
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2015 International Conference on
  • Type

    conf

  • DOI
    10.1109/ICMLC.2015.7340934
  • Filename
    7340934