• DocumentCode
    234361
  • Title

    Mixed method for extraction of domain terminology from text: Linguistic and statistical filtering

  • Author

    Lamrani, El Khadir ; Ben Lahmar, El Habib ; Marzak, Abdelaziz ; Ballaoui, Hammad

  • Author_Institution
    Lab. de Technol. de l´Inf. et Modelisation, Univ. Hassan II - Mohammedia, Casablanca, Morocco
  • fYear
    2014
  • fDate
    20-22 Oct. 2014
  • Firstpage
    291
  • Lastpage
    295
  • Abstract
    Extraction of identifier terminology from a specific domain is an indispensable task in extracting information from text, In this work we propose a hybrid method of extracting complex terms from Arabic texts which combines between linguistic and statistical approach, which focuses on a linguistic and morph syntactic analysis of the Arabic language deep to introduce an linguistic filtering algorithm of complex terms.
  • Keywords
    computational linguistics; information filtering; natural language processing; text analysis; Arabic language; Arabic texts; domain terminology; identifier terminology; information extraction; linguistic; statistical filtering; Data mining; Decision support systems; Filtering; Filtering algorithms; Pragmatics; Syntactics; Terminology; extraction of terminology; extraction of the information; linguistic analysis; linguistic filter; morph syntactic analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science and Technology (CIST), 2014 Third IEEE International Colloquium in
  • Conference_Location
    Tetouan
  • Print_ISBN
    978-1-4799-5978-5
  • Type

    conf

  • DOI
    10.1109/CIST.2014.7016634
  • Filename
    7016634