DocumentCode :
3756103
Title :
A Named Entities Recognition System for Modern Standard Arabic using Rule-Based Approach
Author :
Hala Elsayed;Tarek Elghazaly
Author_Institution :
Comput. &
fYear :
2015
fDate :
4/1/2015 12:00:00 AM
Firstpage :
51
Lastpage :
54
Abstract :
Named Entity Recognition (NER) is a task in Information Extraction (IE). The Named Entity Recognition has become very important for Natural Language Processing (NLP). In this paper, we designed a system which enhanced the named entities recognition for Arabic language where the system was developed for Arabic nouns and entities extractions. The nouns extraction system is based on Arabic morphological, the Arabic grammar rules a lot of them are not used before. The noun extraction in the system uses no gazetteers and the system is combined with entities extraction system depending on gazetteers. The system extracts noun according to morphological Arabic and classify them into proper nouns entities, title entities, currency entities, percentage entities, countries entities, cities entities, nationality entities, number entities, places entities, date entities and time entities. The system applied algorithms for generate nationality entities from countries entities, and the system applied Regular Expression (RE) for extract numbers in digit format. The system is not needed to normalization into the text before extraction process. The system tested text that is in the Modern Standard Arabic (MSA), the corpus is in open text. The system achieves results in an average recall of 85%.
Keywords :
"Grammar","Logic gates","Algorithm design and analysis","Standards","Information retrieval","Cities and towns","Morphology"
Publisher :
ieee
Conference_Titel :
Arabic Computational Linguistics (ACLing), 2015 First International Conference on
Print_ISBN :
978-1-4673-9154-2
Type :
conf
DOI :
10.1109/ACLing.2015.14
Filename :
7422279
Link To Document :
بازگشت