• DocumentCode
    3779464
  • Title

    Text readability for Arabic as a foreign language

  • Author

    Hind Saddiki;Karim Bouzoubaa;Violetta Cavalli-Sforza

  • Author_Institution
    Department of Computer Science, Mohammadia School of Engineering, UM5-Agdal, Rabat, Morocco
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    In this study, we evaluate the informativeness of lexical, morphological and semantic features in determining the readability of texts geared towards learners of Arabic as a foreign language. We have gathered low-complexity features with the purpose of establishing a baseline for future research in readability assessment, using freely available natural language processing (NLP) and machine learning (ML) tools on a publicly accessible corpus. We tested common classification algorithms, as well as random forests-an ensemble learning method-and report on their results using several evaluation measures for comparability with similar work. Our results suggest that a small set of easily computed features can be indicative of the reading level of a text. Moreover, our findings will serve as a common ground, for ourselves and others, to evaluate and compare the performance of more elaborate techniques and feature sets.
  • Keywords
    "Training","Artificial intelligence"
  • Publisher
    ieee
  • Conference_Titel
    Computer Systems and Applications (AICCSA), 2015 IEEE/ACS 12th International Conference of
  • Electronic_ISBN
    2161-5330
  • Type

    conf

  • DOI
    10.1109/AICCSA.2015.7507232
  • Filename
    7507232