Title :
A framework for sentiment analysis on schema-based research content via lexica analysis
Author :
Goodarzi, Marjan ; Mahmoudi, Maryam Tayefeh ; Zamani, Ramin
Author_Institution :
Assessment Group, ICT Res. Inst., Tehran, Iran
Abstract :
Sentiment Analysis is one of the significant issues in the area of natural language processing, computational linguistics and text mining. It has also become a potential research area in bibliographic search and opinion mining, which is our main focus in this paper. Sentiment analysis of citations on schema-based research contents, such as scientific articles and reports, may not only makes an appropriate understanding of the issue, but also enhances the level of reliability of citations in those contents. Taking the above points into account, in this paper, a framework for sentiment analysis based on lexica analysis is proposed to determine the polarity of author´s opinion about citations in content. This framework consists of three major steps: subjectivity detection, polarity classification and strength determination of opinion. In each step, some considerations are considered to improve the capability of proposed framework. We have applied SentiWordNet, AFINN and Bingliu lexica sets for this purpose. The schema-based research contents that have been considered for this experiment is PubMed, which consists of open source medical articles of US digital library of medicine. Experimental results reveal the superiority of SentiWordNet lexica set compared to the other existing sets in sentiment analysis of PubMed articles.
Keywords :
citation analysis; digital libraries; medical computing; natural language processing; public domain software; text analysis; AFINN; Bingliu lexica sets; PubMed articles; SentiWordNet; US digital library; bibliographic search; computational linguistics; lexica analysis; natural language processing; open source medical articles; opinion mining; polarity classification; schema-based research content; sentiment analysis; strength determination; subjectivity detection; text mining; Abstracts; Context; Dictionaries; Libraries; Pragmatics; Sentiment analysis; Syntactics; Sentiment analysis; lexica set; opinion mining; opinion strength determination; polarity classification; schema-based content;
Conference_Titel :
Telecommunications (IST), 2014 7th International Symposium on
Conference_Location :
Tehran
Print_ISBN :
978-1-4799-5358-5
DOI :
10.1109/ISTEL.2014.7000738