Title :
Arabic verb pattern extraction
Author :
Saad, E.M. ; Awadalla, M.H. ; Alajmi, A.
Author_Institution :
Commun. & Electron. Dept., Helwan Univ., Cairo, Egypt
Abstract :
Arabic is a highly inflected language, and therefore the processes of stemming and root extracting represent a challenge to researches. A new method is presented for extracting Arabic text stem, and lemma. Stemming sometimes affects the semantic of a word, where as lemma preserve the meaning of a word. The approach is based on pattern extraction. It uses a special encoding based on dividing letters into original and non-original letters. Codes are automatically generated for each pattern and then match against input text to extract root, pattern, and lemma of a word. A comparison with other methods reveals a promising result with accuracy up to 96%.
Keywords :
computational linguistics; data handling; feature extraction; pattern recognition; Arabic text stem extraction; Arabic verb pattern extraction; inflected language; Morphological Analyzer; Natural Language Processing; Root Extraction;
Conference_Titel :
Information Sciences Signal Processing and their Applications (ISSPA), 2010 10th International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-7165-2
DOI :
10.1109/ISSPA.2010.5605427