DocumentCode :
169589
Title :
Benchmark of Arabic morphological analyzers challenges and solutions
Author :
Jaafar, Younes ; Bouzoubaa, Karim
Author_Institution :
Mohammadia Sch. of Eng., Mohammed Vth Univ. - Adgal, Rabat, Morocco
fYear :
2014
fDate :
7-8 May 2014
Firstpage :
1
Lastpage :
6
Abstract :
Arabic Natural Language Processing (ANLP) has known an important development during the last decade. Nowadays, several ANLP tools are already developed such as morphological analyzers. These analyzers are often used in more advanced applications such as syntactic parsers, search engines, machine translation systems, etc. However, the choice of a morphological analyzer to use, among others, can be difficult for researchers if they ignore its metrics. In this article, we present the challenges of the benchmark of Arabic morphological analyzers. We present also our solution developed in Java, which allows the benchmark by returning the most common metrics, namely the accuracy, precision, f-measure and execution time. This solution has the advantage of being cross-platform, flexible and allows to be extended to cover new morphological analyzers to compare.
Keywords :
Java; natural language processing; ANLP tools; Arabic morphological analyzers; Arabic natural language processing; Java; accuracy metrics; cross-platform; execution time metrics; f-measure metrics; precision metrics; Accuracy; Benchmark testing; Gold; Java; Measurement; Standards; XML; Arabic morphological analyzers; Benchmark; SAFAR platform; Standard corpus;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems: Theories and Applications (SITA-14), 2014 9th International Conference on
Conference_Location :
Rabat
Print_ISBN :
978-1-4799-3566-6
Type :
conf
DOI :
10.1109/SITA.2014.6847312
Filename :
6847312
Link To Document :
بازگشت