• DocumentCode
    3756115
  • Title

    New Approaches for Extracting Arabic Keyphrases

  • Author

    Mahmoud Nabil;Amir F. Atiya;Mohamed Aly

  • Author_Institution
    Dept. of Comput. Eng., Cairo Univ., Giza, Egypt
  • fYear
    2015
  • fDate
    4/1/2015 12:00:00 AM
  • Firstpage
    133
  • Lastpage
    137
  • Abstract
    Keyphrases extraction has a considerable importance in many applications such as search engine optimization, clustering, summarization, and sentiment analysis. The importance of keyphrases comes from the semantic meaning they provide as they can be used as descriptors for the documents. In this paper we compare four approaches for extracting keyphrases from Arabic documents. The first method uses the KP-Miner keyphrase extraction system. The second method uses Arabic natural language processing tools (stemmer and part of speech tagger) in order to filter some patterns that can be weighted by token frequency inverse document frequency (TF-IDF) algorithm. The third method uses Google´sWord2Vec library to calculate the weighting of the resulting patterns by measuring the similarity of the candidate pattern and the document title. The fourth method combines the weightings result from the second and the third method.
  • Keywords
    "Feature extraction","Pragmatics","Context","Semantics","Natural language processing","Speech","Support vector machines"
  • Publisher
    ieee
  • Conference_Titel
    Arabic Computational Linguistics (ACLing), 2015 First International Conference on
  • Print_ISBN
    978-1-4673-9154-2
  • Type

    conf

  • DOI
    10.1109/ACLing.2015.26
  • Filename
    7422291