Title :
Automated dictionary construction from Arabic corpus for meaningful crime information extraction and document classification
Author :
Alruily, Meshrif ; Ayesh, Aladdin ; Zedan, Hussein
Author_Institution :
Software Technol. Res. Lab., De Montfort Univ., Leicester, UK
Abstract :
Arabic is a very widely spoken language but very few mining tools have been developed to exploit the data that lies within bodies of Arabic text. Thus, this paper presents and then uses three automatic algorithm techniques specifically designed for Arabic. The target domain is crime profiling and the methods involve adaptive dictionary building. The bodies of text are mined for crime type, location and nationality. This work is then validated through three experiments, the results of which show that the techniques developed here are promising.
Keywords :
data mining; dictionaries; natural language processing; pattern classification; text analysis; Arabic corpus; automated dictionary construction; crime information extraction; data mining tool; document classification; text mining; Buildings; Computers; Context; Data mining; Dictionaries; Information systems; Pragmatics;
Conference_Titel :
Computer Information Systems and Industrial Management Applications (CISIM), 2010 International Conference on
Conference_Location :
Krackow
Print_ISBN :
978-1-4244-7817-0
DOI :
10.1109/CISIM.2010.5643676