• DocumentCode
    615076
  • Title

    Distribution-sensitive learning for imbalanced datasets

  • Author

    Song, Yuning ; Morency, Louis-Philippe ; Davis, Ronald W.

  • fYear
    2013
  • fDate
    22-26 April 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Many real-world face and gesture datasets are by nature imbalanced across classes. Conventional statistical learning models (e.g., SVM, HMM, CRY), however, are sensitive to imbalanced datasets. In this paper we show how an imbalanced dataset affects the performance of a standard learning algorithm, and propose a distribution-sensitive prior to deal with the imbalanced data problem. This prior analyzes the training dataset before learning a model, and puts more weight on the samples from underrepresented classes, allowing all samples in the dataset to have a balanced impact in the learning process. We report on two empirical studies regarding learning with imbalanced data, using two publicly available recent gesture datasets, the Microsoft Research Cambridge-12 (MSRC-12) and NATOPS aircraft handling signals datasets. Experimental results show that learning from balanced data is important, and that the distribution-sensitive prior improves performance with imbalanced datasets.
  • Keywords
    data analysis; hidden Markov models; learning (artificial intelligence); support vector machines; CRY; HMM; MSRC-12; Microsoft Research Cambridge-12; NATOPS aircraft handling signals datasets; SVM; distribution-sensitive learning; face datasets; gesture datasets; imbalanced datasets; nature imbalanced across classes; standard learning algorithm; statistical learning models; Standards;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Face and Gesture Recognition (FG), 2013 10th IEEE International Conference and Workshops on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-5545-2
  • Electronic_ISBN
    978-1-4673-5544-5
  • Type

    conf

  • DOI
    10.1109/FG.2013.6553715
  • Filename
    6553715