DocumentCode :
3208512
Title :
Automated dictionary construction from Arabic corpus for meaningful crime information extraction and document classification
Author :
Alruily, Meshrif ; Ayesh, Aladdin ; Zedan, Hussein
Author_Institution :
Software Technol. Res. Lab., De Montfort Univ., Leicester, UK
fYear :
2010
fDate :
8-10 Oct. 2010
Firstpage :
137
Lastpage :
142
Abstract :
Arabic is a very widely spoken language but very few mining tools have been developed to exploit the data that lies within bodies of Arabic text. Thus, this paper presents and then uses three automatic algorithm techniques specifically designed for Arabic. The target domain is crime profiling and the methods involve adaptive dictionary building. The bodies of text are mined for crime type, location and nationality. This work is then validated through three experiments, the results of which show that the techniques developed here are promising.
Keywords :
data mining; dictionaries; natural language processing; pattern classification; text analysis; Arabic corpus; automated dictionary construction; crime information extraction; data mining tool; document classification; text mining; Buildings; Computers; Context; Data mining; Dictionaries; Information systems; Pragmatics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Information Systems and Industrial Management Applications (CISIM), 2010 International Conference on
Conference_Location :
Krackow
Print_ISBN :
978-1-4244-7817-0
Type :
conf
DOI :
10.1109/CISIM.2010.5643676
Filename :
5643676
Link To Document :
بازگشت