Title :
Thai-English spam SMS filtering
Author :
Khemapatapan, Chaiyaporn
Author_Institution :
Comput. & Telecommun. Eng., Dhurakij Pundit Univ., Bangkok, Thailand
fDate :
Oct. 31 2010-Nov. 3 2010
Abstract :
SMS spam filtering for Thai-English language has not previously been studied and implemented. Two methods of spam SMS message filtering objected to filter spam SMS messages written in Thai and English have been studied and implemented. The first method simply uses current spam English message filtering and then upgrades for Thai language support. The second one applies text normalization, word segmentation process, and analyzing/correcting the semantic of Thai words. However, both methods are applied by 2 different decision-making algorithms: Support Vector Machine (SVM) and Naive Bayesian (NB) algorithms. Finally, the results from trial applying in the real system are shown. The results show that the second filtering method has higher accuracy. Moreover, the SVM based filtering consumes more processing time than the NB based filtering about 2.5 times for both proposed methods.
Keywords :
belief networks; electronic messaging; mobile radio; natural language processing; support vector machines; telecommunication computing; telecommunication security; text analysis; SMS spam filtering; Thai-English language; naive Bayesian algorithm; support vector machine; text normalization; word segmentation process; Accuracy; Classification algorithms; Filtering; Niobium; Semantics; Support vector machines; Unsolicited electronic mail; Naive Bayesian; Support Vector Machine; Thai-English Spam SMS Filtering;
Conference_Titel :
Communications (APCC), 2010 16th Asia-Pacific Conference on
Conference_Location :
Auckland
Print_ISBN :
978-1-4244-8128-6
Electronic_ISBN :
978-1-4244-8127-9
DOI :
10.1109/APCC.2010.5679770