DocumentCode :
3779464
Title :
Text readability for Arabic as a foreign language
Author :
Hind Saddiki;Karim Bouzoubaa;Violetta Cavalli-Sforza
Author_Institution :
Department of Computer Science, Mohammadia School of Engineering, UM5-Agdal, Rabat, Morocco
fYear :
2015
Firstpage :
1
Lastpage :
8
Abstract :
In this study, we evaluate the informativeness of lexical, morphological and semantic features in determining the readability of texts geared towards learners of Arabic as a foreign language. We have gathered low-complexity features with the purpose of establishing a baseline for future research in readability assessment, using freely available natural language processing (NLP) and machine learning (ML) tools on a publicly accessible corpus. We tested common classification algorithms, as well as random forests-an ensemble learning method-and report on their results using several evaluation measures for comparability with similar work. Our results suggest that a small set of easily computed features can be indicative of the reading level of a text. Moreover, our findings will serve as a common ground, for ourselves and others, to evaluate and compare the performance of more elaborate techniques and feature sets.
Keywords :
"Training","Artificial intelligence"
Publisher :
ieee
Conference_Titel :
Computer Systems and Applications (AICCSA), 2015 IEEE/ACS 12th International Conference of
Electronic_ISBN :
2161-5330
Type :
conf
DOI :
10.1109/AICCSA.2015.7507232
Filename :
7507232
Link To Document :
بازگشت