DocumentCode
3302322
Title
Data Base Investigation as a Ranking Problem
Author
Veenman, Cor J.
Author_Institution
Digital Technol. & Biometrics Dept., Netherlands Forensic Inst., The Hague, Netherlands
fYear
2012
fDate
22-24 Aug. 2012
Firstpage
225
Lastpage
231
Abstract
When data mining for forensic investigations, we are typically confronted with strongly imbalanced classes. Moreover, the labels of the non-target or negative class are usually not confirmed. In other words, the non-positive objects are unlabeled. For these situations classification methods are not well suited. We propose to approach these problems as ranking problems. We apply several supervised learning methods, including recently developed methods that are specifically aimed at optimizing ranking performance. With a true investigation dataset, we show the improvement over the prior probabilities using the ranking approach. It turns out that some two-class classification methods perform competitively on ranking performance, while the true ranking methods do not stand out.
Keywords
computer forensics; data mining; database management systems; learning (artificial intelligence); optimisation; pattern classification; probability; data mining; database investigation; forensic investigations; negative class labels; nontarget class labels; prior probabilities; ranking performance optimization; ranking problem; supervised learning methods; two-class classification methods; unlabeled nonpositive objects; Accuracy; Equations; Mathematical model; Noise; Optimization; Sociology; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligence and Security Informatics Conference (EISIC), 2012 European
Conference_Location
Odense
Print_ISBN
978-1-4673-2358-1
Type
conf
DOI
10.1109/EISIC.2012.44
Filename
6298835
Link To Document