• DocumentCode
    1358892
  • Title

    Parameter Selection for Principal Curves

  • Author

    Biau, Gérard ; Fischer, Aurélie

  • Author_Institution
    Univ. Pierre et Marie Curie-Paris VI, Paris, France
  • Volume
    58
  • Issue
    3
  • fYear
    2012
  • fDate
    3/1/2012 12:00:00 AM
  • Firstpage
    1924
  • Lastpage
    1939
  • Abstract
    Principal curves are nonlinear generalizations of the notion of first principal component. Roughly, a principal curve is a parameterized curve in which passes through the “middle” of a data cloud drawn from some unknown probability distribution. Depending on the definition, a principal curve relies on some unknown parameters (number of segments, length, turn, etc.) which have to be properly chosen to recover the shape of the data without interpolating. In this paper, we consider the principal curve problem from an empirical risk minimization perspective and address the parameter selection issue using the point of view of model selection via penalization. We offer oracle inequalities and implement the proposed approach to recover the hidden structures in both simulated and real-life data.
  • Keywords
    curve fitting; statistical distributions; data cloud; model selection; nonlinear generalization; oracle inequality; parameter selection; parameterized curve; penalization; principal curve problem; probability distribution; risk minimization; Approximation algorithms; Complexity theory; Context; Indexes; Principal component analysis; Shape; Upper bound; Model selection; oracle inequality; parameter selection; penalty calibration; principal curves; slope heuristics;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2011.2173157
  • Filename
    6058656