Title :
A Graph-Based Method for Detecting Rare Events: Identifying Pathologic Cells
Author :
Szekely, Eniko ; Sallaberry, Arnaud ; Zaidi, Faraz ; Poncelet, Pascal
Abstract :
Detection of outliers and anomalous behavior is a well-known problem in the data mining and statistics fields. Although the problem of identifying single outliers has been extensively studied in the literature, little effort has been devoted to detecting small groups of outliers that are similar to each other but markedly different from the entire population. Many real-world scenarios have small groups of outliers--for example, a group of students who excel in a classroom or a group of spammers in an online social network. In this article, the authors propose a novel method to solve this challenging problem that lies at the frontiers of outlier detection and clustering of similar groups. The method transforms a multidimensional dataset into a graph, applies a network metric to detect clusters, and renders a representation for visual assessment to find rare events. The authors tested the proposed method to detect pathologic cells in the biomedical science domain. The results are promising and confirm the available ground truth provided by the domain experts.
Keywords :
biology computing; cellular biophysics; data mining; graph theory; medical computing; pattern clustering; statistical analysis; anomalous behavior detection; biomedical science domain; cluster detection; data mining; graph-based method; multidimensional dataset; network metric; outlier detection; pathologic cell identification; rare event detection; statistics fields; visual assessment; Biological cells; Biomedical image processing; Cells (biology); Computer graphics; Data mining; Data visulaization; Nuclear magnetic resonance; Pathological processes; clustering; computer graphics; group of outliers; outlier detection; pathologic cells; rare events; visualization;
Journal_Title :
Computer Graphics and Applications, IEEE
DOI :
10.1109/MCG.2014.78