Title :
Semi-supervised outlier detection with only positive and unlabeled data based on fuzzy clustering
Author :
Daneshpazhouh, Armin ; Sami, Ashkan
Author_Institution :
Sch. of Electr. & Comput. Eng., Shiraz Univ., Shiraz, Iran
Abstract :
The task of semi-supervised outlier detection is to find the instances that are exceptional from other data with the use of some labeled examples. In many applications such as fraud detection and intrusion detection, this issue becomes important. Most existing techniques are unsupervised and the semi-supervised approaches use both negative and positive instances to detect outliers. However, in many real-world applications, very few positive labeled examples are available. This paper proposes an effective method to address this problem. This method is based on two steps. First, extracting reliable negative instances by KNN technique and then using fuzzy clustering with both negative and positive examples for outlier detection. Experimental results on real datasets demonstrate that the proposed method outperforms the previous methods in detecting outliers.
Keywords :
fuzzy set theory; pattern clustering; security of data; unsupervised learning; KNN technique; fraud detection; fuzzy clustering; intrusion detection; reliable negative instance extraction; semisupervised learning approach; semisupervised outlier detection approach; unsupervised learning approach; Algorithm design and analysis; Cancer; Classification algorithms; Data mining; Knowledge discovery; Reliability; Training; dataminig; outlier detection; semi-supervised learning;
Conference_Titel :
Information and Knowledge Technology (IKT), 2013 5th Conference on
Conference_Location :
Shiraz
Print_ISBN :
978-1-4673-6489-8
DOI :
10.1109/IKT.2013.6620091