• DocumentCode
    1508225
  • Title

    Preparng the right data diet for training neural networks

  • Author

    Yale, Karla

  • Author_Institution
    Yale Systems Inc., Columbus, IN, USA
  • Volume
    34
  • Issue
    3
  • fYear
    1997
  • fDate
    3/1/1997 12:00:00 AM
  • Firstpage
    64
  • Lastpage
    66
  • Abstract
    Neural networks are a good way to interrelate nonlinear variables in a robust manner. The simplex method for optimization is not nearly as effectual, and neither are the various statistical methods for classifying and associating data and predicting results. The reason is that neural networks are put through a training phase, during which they can automatically fine-tune themselves as often as proves necessary to get the desired performance. Of course, the old adage “garbage in...garbage out” applies as much to neural networks as it does to all other data-processing applications. If the training data set (the collection of input data and its associated correct output data) is not thoughtfully chosen, the resulting network is unlikely to hold up well in an industrial environment. So it is hardly surprising that massaging the set of training data consumes some 80 percent of the engineering time spent getting a real-world neural network up and running-that is, getting it to converge under a broad enough range of conditions to be deployed with confidence in a production situation. If that data preparation is done systematically, much time can be saved and a more robust end-product can be obtained. A nine-step process is given that experience (the author´s) has shown can enhance the probability of obtaining a learning convergence robust enough for industrial use
  • Keywords
    data preparation; learning (artificial intelligence); neural nets; correct output data; data preparation; industrial environment; input data; learning convergence; neural networks training; nonlinear variables; training data set; Convergence; Databases; Karhunen-Loeve transforms; Multidimensional systems; Neural networks; Physics; Sampling methods; Statistical distributions; Testing; Training data;
  • fLanguage
    English
  • Journal_Title
    Spectrum, IEEE
  • Publisher
    ieee
  • ISSN
    0018-9235
  • Type

    jour

  • DOI
    10.1109/6.576011
  • Filename
    576011