DocumentCode :
3772284
Title :
A Comparison of Similarity Metrics for Sentiment Analysis on Turkish Twitter Feeds
Author :
?nder ?oban;Baris ?zyer and G?lsah T?m?kl? ?zyer
Author_Institution :
Dept. of Comput. Eng., Atatυ
fYear :
2015
Firstpage :
333
Lastpage :
338
Abstract :
Sentiment analysis is one of the most useful toolsi n social media monitoring. Implementing sentiment analysis on social media data (Blogs, Twitter, and Facebook etc.) is beneficial to measure customer satisfaction and as a result can reduce production cost for a company. Moreover, sentiment analysis can be used in various other domains, such as economics, commerce and opinion mining to collect data for obtaining meaningful information. In this study, our major goal is to investigate the positive/negative polarity of Turkish Twitter feeds by using text classification methods for sentiment analysis. Bag of Words and N-Gram models are used to extract the content of text in feature extraction phase. Different similarity metrics are analyzed toimprove the performance of the kNN classifier on both Reuters-8 and Turkish Twitter Feeds data. The Reuters-8 data used to analyze effect of text language and length on classfication results. The experiments are conducted on six different combinations of feature extraction models and weighting methods. Experimental results show that IT-Sim gives better performance compared to other classification metrics and Tf-Idf is the most effective weighting method. The accuracy of the kNN classifier is depended on combination feature extraction model with different weighting methods and the values of k parameter.
Keywords :
"Feature extraction","Measurement","Sentiment analysis","Twitter","Feeds","Media","Blogs"
Publisher :
ieee
Conference_Titel :
Smart City/SocialCom/SustainCom (SmartCity), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/SmartCity.2015.93
Filename :
7463747
Link To Document :
بازگشت