DocumentCode
234361
Title
Mixed method for extraction of domain terminology from text: Linguistic and statistical filtering
Author
Lamrani, El Khadir ; Ben Lahmar, El Habib ; Marzak, Abdelaziz ; Ballaoui, Hammad
Author_Institution
Lab. de Technol. de l´Inf. et Modelisation, Univ. Hassan II - Mohammedia, Casablanca, Morocco
fYear
2014
fDate
20-22 Oct. 2014
Firstpage
291
Lastpage
295
Abstract
Extraction of identifier terminology from a specific domain is an indispensable task in extracting information from text, In this work we propose a hybrid method of extracting complex terms from Arabic texts which combines between linguistic and statistical approach, which focuses on a linguistic and morph syntactic analysis of the Arabic language deep to introduce an linguistic filtering algorithm of complex terms.
Keywords
computational linguistics; information filtering; natural language processing; text analysis; Arabic language; Arabic texts; domain terminology; identifier terminology; information extraction; linguistic; statistical filtering; Data mining; Decision support systems; Filtering; Filtering algorithms; Pragmatics; Syntactics; Terminology; extraction of terminology; extraction of the information; linguistic analysis; linguistic filter; morph syntactic analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Science and Technology (CIST), 2014 Third IEEE International Colloquium in
Conference_Location
Tetouan
Print_ISBN
978-1-4799-5978-5
Type
conf
DOI
10.1109/CIST.2014.7016634
Filename
7016634
Link To Document