Title :
Benchmark of Arabic morphological analyzers challenges and solutions
Author :
Jaafar, Younes ; Bouzoubaa, Karim
Author_Institution :
Mohammadia Sch. of Eng., Mohammed Vth Univ. - Adgal, Rabat, Morocco
Abstract :
Arabic Natural Language Processing (ANLP) has known an important development during the last decade. Nowadays, several ANLP tools are already developed such as morphological analyzers. These analyzers are often used in more advanced applications such as syntactic parsers, search engines, machine translation systems, etc. However, the choice of a morphological analyzer to use, among others, can be difficult for researchers if they ignore its metrics. In this article, we present the challenges of the benchmark of Arabic morphological analyzers. We present also our solution developed in Java, which allows the benchmark by returning the most common metrics, namely the accuracy, precision, f-measure and execution time. This solution has the advantage of being cross-platform, flexible and allows to be extended to cover new morphological analyzers to compare.
Keywords :
Java; natural language processing; ANLP tools; Arabic morphological analyzers; Arabic natural language processing; Java; accuracy metrics; cross-platform; execution time metrics; f-measure metrics; precision metrics; Accuracy; Benchmark testing; Gold; Java; Measurement; Standards; XML; Arabic morphological analyzers; Benchmark; SAFAR platform; Standard corpus;
Conference_Titel :
Intelligent Systems: Theories and Applications (SITA-14), 2014 9th International Conference on
Conference_Location :
Rabat
Print_ISBN :
978-1-4799-3566-6
DOI :
10.1109/SITA.2014.6847312