• DocumentCode
    3208512
  • Title

    Automated dictionary construction from Arabic corpus for meaningful crime information extraction and document classification

  • Author

    Alruily, Meshrif ; Ayesh, Aladdin ; Zedan, Hussein

  • Author_Institution
    Software Technol. Res. Lab., De Montfort Univ., Leicester, UK
  • fYear
    2010
  • fDate
    8-10 Oct. 2010
  • Firstpage
    137
  • Lastpage
    142
  • Abstract
    Arabic is a very widely spoken language but very few mining tools have been developed to exploit the data that lies within bodies of Arabic text. Thus, this paper presents and then uses three automatic algorithm techniques specifically designed for Arabic. The target domain is crime profiling and the methods involve adaptive dictionary building. The bodies of text are mined for crime type, location and nationality. This work is then validated through three experiments, the results of which show that the techniques developed here are promising.
  • Keywords
    data mining; dictionaries; natural language processing; pattern classification; text analysis; Arabic corpus; automated dictionary construction; crime information extraction; data mining tool; document classification; text mining; Buildings; Computers; Context; Data mining; Dictionaries; Information systems; Pragmatics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Information Systems and Industrial Management Applications (CISIM), 2010 International Conference on
  • Conference_Location
    Krackow
  • Print_ISBN
    978-1-4244-7817-0
  • Type

    conf

  • DOI
    10.1109/CISIM.2010.5643676
  • Filename
    5643676