• DocumentCode
    163456
  • Title

    Towards author identification of Arabic text articles

  • Author

    Otoom, Ahmed Fawzi ; Abdullah, Emad E. ; Jaafer, Shifaa ; Hamdallh, Aseel ; Amer, Dana

  • Author_Institution
    Fac. of Prince Al-Hussein Bin Abdullah II for Inf. Technol., Hashemite Univ., Zarqa, Jordan
  • fYear
    2014
  • fDate
    1-3 April 2014
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    We target the problem of identifying the author of an Arabic text article. Our main aim is to develop an intelligent system that is capable of classifying a new article into one of seven classes that belong to seven different authors. For this purpose, we propose a novel dataset consisting of 12 features and 456 instances belonging to the 7 authors. In addition, we combine the proposed feature set with strong classification algorithms to assist in distinguishing between the different authors. Our results show that the proposed dataset has proved successful with a classification performance accuracy of 82% with the hold-out test.
  • Keywords
    classification; learning (artificial intelligence); natural language processing; Arabic text article; author identification; classification algorithm; feature set; intelligent system; Accuracy; Classification algorithms; Feature extraction; Support vector machines; Syntactics; Testing; Writing; Arabic text features; authorship identification; functional trees; support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Communication Systems (ICICS), 2014 5th International Conference on
  • Conference_Location
    Irbid
  • Print_ISBN
    978-1-4799-3022-7
  • Type

    conf

  • DOI
    10.1109/IACS.2014.6841971
  • Filename
    6841971