Title of article :
Performance of KNN and SVM classifiers on full word Arabic articles
Author/Authors :
Ismail Hmeidi، نويسنده , , Ismail and Hawashin، نويسنده , , Bilal and El-Qawasmeh، نويسنده , , Eyas، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2008
Abstract :
This paper reports a comparative study of two machine learning methods on Arabic text categorization. Based on a collection of news articles as a training set, and another set of news articles as a testing set, we evaluated K nearest neighbor (KNN) algorithm, and support vector machines (SVM) algorithm. We used the full word features and considered the tf.idf as the weighting method for feature selection, and CHI statistics as a ranking metric. Experiments showed that both methods were of superior performance on the test corpus while SVM showed a better micro average F1 and prediction time.
Keywords :
CHI statistics , SVM , Full word features , kNN , tf.idf weighting , Arabic text categorization
Journal title :
ADVANCED ENGINEERING INFORMATICS
Journal title :
ADVANCED ENGINEERING INFORMATICS