DocumentCode
538035
Title
Using Self Organizing Map to cluster Arabic crime documents
Author
Alruily, Meshrif ; Ayesh, Aladdin ; Al-Marghilani, Abdulsamad
Author_Institution
Software Technol. Res. Lab., De Montfort Univ., Leicester, UK
fYear
2010
fDate
18-20 Oct. 2010
Firstpage
357
Lastpage
363
Abstract
This paper presents a system that combines two text mining techniques; information extraction and clustering. A rule-based approach is used to perform the information extraction task, based on the dependency relation between some intransitive verbs and prepositions. This relationship helps in extracting types of crime from documents within the crime domain. With regard to the clustering task, the Self Organizing Map (SOM) is used to cluster Arabic crime documents based on crime types. This work is then validated through experiments, the results of which show that the techniques developed here are promising.
Keywords
data mining; document handling; information retrieval; knowledge based systems; natural language processing; pattern clustering; self-organising feature maps; task analysis; Arabic crime document clustering; dependency relation; information clustering; information extraction task; intransitive prepositions; intransitive verbs; rule based approach; self organizing map; text mining; Context; Data mining; Data visualization; Grammar; Neurons; Organizing; Pragmatics;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Technology (IMCSIT), Proceedings of the 2010 International Multiconference on
Conference_Location
Wisla
ISSN
2157-5525
Print_ISBN
978-1-4244-6432-6
Type
conf
DOI
10.1109/IMCSIT.2010.5679616
Filename
5679616
Link To Document