• DocumentCode
    1842104
  • Title

    Cross validation and MLP architecture selection

  • Author

    Andersen, Tim ; Martinez, Tony

  • Author_Institution
    Dept. of Comput. Sci., Brigham Young Univ., Provo, UT, USA
  • Volume
    3
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    1614
  • Abstract
    The performance of cross validation (CV) based MLP architecture selection is examined using 14 real world problem domains. When testing many different network architectures the results show that CV is only slightly more likely than random to select the optimal network architecture, and that the strategy of using the simplest available network architecture performs better than CV in this case. Experimental evidence suggests several reasons for the poor performance of CV. In addition, three general strategies which lead to significant increase in the performance of CV are proposed. While this paper focuses on using CV to select the optimal MLP architecture, the strategies are also applicable when CV is used to select between several different learning models, whether the models are neural networks, decision trees, or other types of learning algorithms. When using these strategies the average generalization performance of the network architecture which CV selects is significantly better than the performance of several other well known machine learning algorithms on the data sets tested
  • Keywords
    generalisation (artificial intelligence); learning (artificial intelligence); multilayer perceptrons; neural net architecture; optimisation; CV-based MLP architecture selection; cross validation; decision trees; generalization; learning models; multilayer perceptron; neural networks; optimal network architecture selection; Bayesian methods; Computer architecture; Computer science; Decision trees; Machine learning algorithms; Multilayer perceptrons; Neural networks; Pressing; Testing; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 1999. IJCNN '99. International Joint Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-5529-6
  • Type

    conf

  • DOI
    10.1109/IJCNN.1999.832613
  • Filename
    832613