• DocumentCode
    2207723
  • Title

    Rare Category Characterization

  • Author

    He, Jingrui ; Tong, Hanghang ; Carbonell, Jaime

  • Author_Institution
    Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    2010
  • fDate
    13-17 Dec. 2010
  • Firstpage
    226
  • Lastpage
    235
  • Abstract
    Rare categories abound and their characterization has heretofore received little attention. Fraudulent banking transactions, network intrusions, and rare diseases are examples of rare classes whose detection and characterization are of high value. However, accurate characterization is challenging due to high-skewness and non-separability from majority classes, e.g., fraudulent transactions masquerade as legitimate ones. This paper proposes the RACH algorithm by exploring the compactness property of the rare categories. It is based on an optimization framework which encloses the rare examples by a minimum-radius hyper ball. The framework is then converted into a convex optimization problem, which is in turn effectively solved in its dual form by the projected sub gradient method. RACH can be naturally kernelized. Experimental results validate the effectiveness of RACH.
  • Keywords
    data handling; optimisation; RACH algorithm; convex optimization problem; fraudulent banking transactions; minimum-radius hyperball; network intrusions; rare category characterization; subgradient method; characterization; compactness; hyperball; minority class; optimization; rare category; subgradient;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2010 IEEE 10th International Conference on
  • Conference_Location
    Sydney, NSW
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4244-9131-5
  • Electronic_ISBN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2010.154
  • Filename
    5693976