Title :
A comparison of similarity measures for online social media Thai text classification
Author :
Viriyavisuthisakul, Supatta ; Sanguansat, Parinya ; Charnkeitkong, Pisit ; Haruechaiyasak, Choochart
Author_Institution :
Fac. of Eng. & Technol., Panyapiwat Inst. of Manage., Nonthaburi, Thailand
Abstract :
Social media is widely used as a channel of communication in general purposes, including the comment that are related to retail business. It is a highly effective communication tool for direct interacting with their customers. Growth rate of the users is rapidly increasing, because they use this channel to receive information and share something interesting. In this paper, we present a comparison experimental results on the similarity measurements on Thai social media text. We use the nearest neighbor classifier with the 10 distance functions to classify there data into 4 classes, which the classes were defined by the expert in retail business. The experiments were performed on two feature extractions with the well-known term weightings and compared the classification results with the experts. According to the experimental results, we found the feature extraction and the distance that can be employed with Thai social media text classification.
Keywords :
feature extraction; social networking (online); text analysis; communication tool; distance functions; feature extractions; general purposes; growth rate; nearest neighbor classifier; online social media Thai text classification; retail business; similarity measurements; term weightings; Accuracy; Business; Chebyshev approximation; Correlation; Euclidean distance; Media; Text categorization; Classification; Similarity measure; Social media; Text mining; Thai; The nearest neighbor;
Conference_Titel :
Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON), 2015 12th International Conference on
Conference_Location :
Hua Hin
DOI :
10.1109/ECTICon.2015.7207106