DocumentCode :
2617827
Title :
Arabic verb pattern extraction
Author :
Saad, E.M. ; Awadalla, M.H. ; Alajmi, A.
Author_Institution :
Commun. & Electron. Dept., Helwan Univ., Cairo, Egypt
fYear :
2010
fDate :
10-13 May 2010
Firstpage :
642
Lastpage :
645
Abstract :
Arabic is a highly inflected language, and therefore the processes of stemming and root extracting represent a challenge to researches. A new method is presented for extracting Arabic text stem, and lemma. Stemming sometimes affects the semantic of a word, where as lemma preserve the meaning of a word. The approach is based on pattern extraction. It uses a special encoding based on dividing letters into original and non-original letters. Codes are automatically generated for each pattern and then match against input text to extract root, pattern, and lemma of a word. A comparison with other methods reveals a promising result with accuracy up to 96%.
Keywords :
computational linguistics; data handling; feature extraction; pattern recognition; Arabic text stem extraction; Arabic verb pattern extraction; inflected language; Morphological Analyzer; Natural Language Processing; Root Extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Sciences Signal Processing and their Applications (ISSPA), 2010 10th International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-7165-2
Type :
conf
DOI :
10.1109/ISSPA.2010.5605427
Filename :
5605427
Link To Document :
بازگشت