• DocumentCode
    1410226
  • Title

    Structural risk minimization over data-dependent hierarchies

  • Author

    Shawe-Taylor, John ; Bartlett, Peter L. ; Williamson, Robert C. ; Anthony, Martin

  • Author_Institution
    Dept. of Comput. Sci., London Univ., UK
  • Volume
    44
  • Issue
    5
  • fYear
    1998
  • fDate
    9/1/1998 12:00:00 AM
  • Firstpage
    1926
  • Lastpage
    1940
  • Abstract
    The paper introduces some generalizations of Vapnik´s (1982) method of structural risk minimization (SRM). As well as making explicit some of the details on SRM, it provides a result that allows one to trade off errors on the training sample against improved generalization performance. It then considers the more general case when the hierarchy of classes is chosen in response to the data. A result is presented on the generalization performance of classifiers with a “large margin”. This theoretically explains the impressive generalization performance of the maximal margin hyperplane algorithm of Vapnik and co-workers (which is the basis for their support vector machines). The paper concludes with a more general result in terms of “luckiness” functions, which provides a quite general way for exploiting serendipitous simplicity in observed data to obtain better prediction accuracy from small training sets. Four examples are given of such functions, including the Vapnik-Chervonenkis (1971) dimension measured on the sample
  • Keywords
    approximation theory; learning systems; minimisation; probability; risk management; set theory; Vapnik´s method; Vapnik-Chervonenkis dimension; classifiers; data-dependent hierarchies; generalization performance; luckiness functions; maximal margin hyperplane algorithm; prediction accuracy; probably approximately correct model; small training sets; structural risk minimization; support vector machines; training sample errors; Accuracy; Australia Council; Convergence; Helium; Machine learning; Risk management; Size measurement; Support vector machine classification; Support vector machines; Virtual colonoscopy;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/18.705570
  • Filename
    705570