DocumentCode :
3582751
Title :
Chunking Arabic texts using Conditional Random Fields
Author :
Khoufi, Nabil ; Aloulou, Chafik ; Belguith, Lamia Hadrich
Author_Institution :
ANLP Res. Group, Univ. of Sfax Sfax, Sfax, Tunisia
fYear :
2014
Firstpage :
428
Lastpage :
432
Abstract :
Chunking or shallow syntactic parsing is proving to be a task of interest to many natural language processing applications. The problem gets worse for the Arabic language because of its specific features that make it quite different and even more ambiguous than other natural languages when processed. In this paper, we present a method for chunking Arabic texts based on supervised learning. We use the Conditional Random Fields algorithm and the Penn Arabic Treebank to train the model. For the experimentation, we use over than 10,100 sentences as training data and 2,524 sentences for the test. The evaluation of the method consists of the calculation of the generated model accuracy and the results are very encouraging.
Keywords :
learning (artificial intelligence); natural language processing; text analysis; Arabic language; Arabic text chunking; Penn Arabic Treebank; conditional random fields; natural language processing applications; sentence evaluation; shallow syntactic parsing; supervised learning; training data; Accuracy; Context; Grammar; Natural language processing; Supervised learning; Syntactics; Training; Arabic language; CRF; Chunking; supervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Systems and Applications (AICCSA), 2014 IEEE/ACS 11th International Conference on
Type :
conf
DOI :
10.1109/AICCSA.2014.7073230
Filename :
7073230
Link To Document :
بازگشت