DocumentCode
3208512
Title
Automated dictionary construction from Arabic corpus for meaningful crime information extraction and document classification
Author
Alruily, Meshrif ; Ayesh, Aladdin ; Zedan, Hussein
Author_Institution
Software Technol. Res. Lab., De Montfort Univ., Leicester, UK
fYear
2010
fDate
8-10 Oct. 2010
Firstpage
137
Lastpage
142
Abstract
Arabic is a very widely spoken language but very few mining tools have been developed to exploit the data that lies within bodies of Arabic text. Thus, this paper presents and then uses three automatic algorithm techniques specifically designed for Arabic. The target domain is crime profiling and the methods involve adaptive dictionary building. The bodies of text are mined for crime type, location and nationality. This work is then validated through three experiments, the results of which show that the techniques developed here are promising.
Keywords
data mining; dictionaries; natural language processing; pattern classification; text analysis; Arabic corpus; automated dictionary construction; crime information extraction; data mining tool; document classification; text mining; Buildings; Computers; Context; Data mining; Dictionaries; Information systems; Pragmatics;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Information Systems and Industrial Management Applications (CISIM), 2010 International Conference on
Conference_Location
Krackow
Print_ISBN
978-1-4244-7817-0
Type
conf
DOI
10.1109/CISIM.2010.5643676
Filename
5643676
Link To Document