DocumentCode
708162
Title
Scalable multi-label Arabic text classification
Author
Ahmed, Nizar A. ; Shehab, Mohammed A. ; Al-Ayyoub, Mahmoud ; Hmeidi, Ismail
Author_Institution
Jordan Univ. of Sci. & Technol., Irbid, Jordan
fYear
2015
fDate
7-9 April 2015
Firstpage
212
Lastpage
217
Abstract
Multi-label text classification (MTC) is a natural extension of the traditional text classification (TC) in which a possibly large set of labels can be assigned to each document. The dimensionality of labels makes MTC difficult and challenging. Several ways are proposed to ease the classification process and one of them is called the problem transformation (PT) method. It is used to transform the multi-labeled data into a single-label one that is suitable for normal classification. Our paper presents a detailed study about using the supervised approach to address the MTC problem for Arabic text. Moreover, the scalability of such an approach is considered in our experiments. The MEKA system is used to convert the multi-label data into a single-label one using different PT methods: LC, BR and RT. Then, different classifiers commonly used for TC such as SVM, NB, KNN, and Decision Tree, are applied to each dataset. The results show that using SVM on the LC dataset generated the best results with 71% ML-accuracy.
Keywords
decision trees; pattern classification; support vector machines; text analysis; KNN classification; LC dataset; MEKA system; MTC; NB classification; PT method; SVM classification; data classification; decision tree classification; k-nearest neighbor classification; label dimensionality; multilabel Arabic text classification; naive Bayes classification; problem transformation method; support vector machines; Accuracy; Loss measurement; Niobium; Scalability; Support vector machines; Training; Exact match; Hamming loss; MEKA; Multi-label classification; Problem transformation methods; Scalability;
fLanguage
English
Publisher
ieee
Conference_Titel
Information and Communication Systems (ICICS), 2015 6th International Conference on
Conference_Location
Amman
Type
conf
DOI
10.1109/IACS.2015.7103229
Filename
7103229
Link To Document