• DocumentCode
    52123
  • Title

    Improving the Generalization Capacity of Cascade Classifiers

  • Author

    Ludwig, Oswaldo ; Nunes, U. ; Ribeiro, Bernardete ; Premebida, Cristiano

  • Author_Institution
    Inst. of Syst. & Robot., Univ. of Coimbra Polo II, Coimbra, Portugal
  • Volume
    43
  • Issue
    6
  • fYear
    2013
  • fDate
    Dec. 2013
  • Firstpage
    2135
  • Lastpage
    2146
  • Abstract
    The cascade classifier is a usual approach in object detection based on vision, since it successively rejects negative occurrences, e.g., background images, in a cascade structure, keeping the processing time suitable for on-the-fly applications. On the other hand, similar to other classifier ensembles, cascade classifiers are likely to have high Vapnik-Chervonenkis (VC) dimension, which may lead to overfitting the training data. Therefore, this work aims at improving the generalization capacity of the cascade classifier by controlling its complexity, which depends on the model of their classifier stages, the number of stages, and the feature space dimension of each stage, which can be controlled by integrating the parameter setting of the feature extractor (in our case an image descriptor) into the maximum-margin framework of support vector machine training, as will be shown in this paper. Moreover, to set the number of cascade stages, bounds on the false positive rate (FP) and on the true positive rate (TP) of cascade classifiers are derived based on a VC-style analysis. These bounds are applied to compose an enveloping receiver operating curve (EROC), i.e., a new curve in the TP-FP space in which each point is an ordered pair of upper bound on the FP and lower bound on the TP. The optimal number of cascade stages is forecasted by comparing EROCs of cascades with different numbers of stages.
  • Keywords
    feature extraction; generalisation (artificial intelligence); image classification; learning (artificial intelligence); support vector machines; EROC; FP; TP; VC dimension; Vapnik-Chervonenkis dimension; background images; cascade classifiers; cascade structure; classifier ensembles; classifier stages; enveloping receiver operating curve; false positive rate; feature extraction; feature space dimension; generalization capacity; maximum-margin framework; object detection; support vector machine training; true positive rate; Biological cells; Complexity theory; Feature extraction; Object detection; Support vector machines; Training; Training data; Cascade classifier; maximal margin principle; pattern recognition; pedestrian detection; statistical learning;
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TCYB.2013.2240678
  • Filename
    6459567