• DocumentCode
    538035
  • Title

    Using Self Organizing Map to cluster Arabic crime documents

  • Author

    Alruily, Meshrif ; Ayesh, Aladdin ; Al-Marghilani, Abdulsamad

  • Author_Institution
    Software Technol. Res. Lab., De Montfort Univ., Leicester, UK
  • fYear
    2010
  • fDate
    18-20 Oct. 2010
  • Firstpage
    357
  • Lastpage
    363
  • Abstract
    This paper presents a system that combines two text mining techniques; information extraction and clustering. A rule-based approach is used to perform the information extraction task, based on the dependency relation between some intransitive verbs and prepositions. This relationship helps in extracting types of crime from documents within the crime domain. With regard to the clustering task, the Self Organizing Map (SOM) is used to cluster Arabic crime documents based on crime types. This work is then validated through experiments, the results of which show that the techniques developed here are promising.
  • Keywords
    data mining; document handling; information retrieval; knowledge based systems; natural language processing; pattern clustering; self-organising feature maps; task analysis; Arabic crime document clustering; dependency relation; information clustering; information extraction task; intransitive prepositions; intransitive verbs; rule based approach; self organizing map; text mining; Context; Data mining; Data visualization; Grammar; Neurons; Organizing; Pragmatics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology (IMCSIT), Proceedings of the 2010 International Multiconference on
  • Conference_Location
    Wisla
  • ISSN
    2157-5525
  • Print_ISBN
    978-1-4244-6432-6
  • Type

    conf

  • DOI
    10.1109/IMCSIT.2010.5679616
  • Filename
    5679616