Title :
An evaluation of writeprint matching method to identify the authors of Thai online messages
Author :
Marukatat, Rangsipan ; Khongrod, Siravich
Author_Institution :
Dept. of Comput. Eng., Mahidol Univ., Nakhon Pathom, Thailand
Abstract :
This research studies the author identification of Thai online messages, based on 54 writing attributes. The method in focus is writeprint matching, which employs frequent pattern mining to create the writeprint of each suspect and computes a similarity score between this writeprint and the pattern found in an anonymous message. It achieved an average accuracy of 82%, while other well-known methods, support vector machine (SVM) and C4.5 decision tree, achieved average accuracies of 89% and 81%, respectively. As for the identification of individual author, all three methods were as good as each other in most cases. The writeprint matching method had potential in reducing Type I error or the chance of dismissing real offenders. However, its performance was still limited when the suspects had too similar writing styles.
Keywords :
Internet; decision trees; image matching; string matching; support vector machines; C4.5 decision tree; SVM; Thai online messages; author identification; frequent pattern mining; support vector machine; writeprint matching method; writing attributes; Accuracy; Data mining; Decision trees; Itemsets; Pattern matching; Support vector machines; Writing; author identification; online messages; writeprint;
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 2015 16th IEEE/ACIS International Conference on
Conference_Location :
Takamatsu
DOI :
10.1109/SNPD.2015.7176200