DocumentCode :
243519
Title :
Applying Cost-Sensitive Classification for Financial Fraud Detection under High Class-Imbalance
Author :
Moepya, Stephen O. ; Akhoury, Sharat S. ; Nelwamondo, Fulufhelo V.
Author_Institution :
Dept. of Electr. & Electron. Eng. Sci., Univ. of Johannesburg, Johannesburg, South Africa
fYear :
2014
fDate :
14-14 Dec. 2014
Firstpage :
183
Lastpage :
192
Abstract :
In recent years, data mining techniques have been used to identify companies who issue fraudulent financial statements. However, most of the research conducted thus far use datasets that are balanced. This does not always represent reality, especially in fraud applications. In this paper, we demonstrate the effectiveness of cost-sensitive classifiers to detect financial statement fraud using South African market data. The study also shows how different levels of cost affect overall accuracy, sensitivity, specificity, recall and precision using PCA and Factor Analysis. Weighted Support Vector Machines (SVM) were shown superior to the cost-sensitive Naive Bayes (NB) and K-Nearest Neighbors classifiers.
Keywords :
Bayes methods; data mining; financial data processing; pattern classification; support vector machines; NB classifier; PCA; SVM; South African market data; cost-sensitive classification; data mining techniques; factor analysis; financial fraud detection; financial statement fraud detection; fraudulent financial statements; high class-imbalance; k-nearest neighbors classifier; naive Bayes classifier; weighted support vector machines; Companies; Data mining; Kernel; Mathematical model; Niobium; Principal component analysis; Support vector machines; cost-sensitive classification; data mining; financial statement fraud; high class-imbalance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
Type :
conf
DOI :
10.1109/ICDMW.2014.141
Filename :
7022596
Link To Document :
بازگشت