• DocumentCode
    2916676
  • Title

    Managing stochastic algorithms cross-validation variability using an interval valued multiple comparison procedure

  • Author

    Otero, Jose ; Sanchez, Luciano ; Palacios, Ana Maria ; Couso, Ines

  • Author_Institution
    Comput. Sci. Dept., Oviedo Univ., Oviedo, Spain
  • fYear
    2011
  • fDate
    22-24 Nov. 2011
  • Firstpage
    1391
  • Lastpage
    1396
  • Abstract
    The usual procedure to compare metaheuristics or evolutionary algorithms using cross validation is repeating the training stage several times for each train/test pair. In general, the results of the different repetitions are not independent and this practice is questionable. In this work, it is suggested to represent the results of each train/test set pair by an interval or by a fuzzy set and use the proper statistical tests to these kind of data. In this way, the paired comparison of these sets leads to an interval valued p-value or to a fuzzy p-value that holds information about the mean differences but also about the differences in dispersion between the compared algorithms.
  • Keywords
    evolutionary computation; fuzzy set theory; learning (artificial intelligence); statistical analysis; stochastic processes; evolutionary algorithm; fuzzy p-value; fuzzy set; interval valued multiple comparison procedure; interval valued p-value; metaheuristics; statistical tests; stochastic algorithm cross-validation variability; train-test set pair; Algorithm design and analysis; Classification algorithms; Diamond-like carbon; Dispersion; Error analysis; Machine learning algorithms; Solids; Experimental design; cross validation; fuzzy statistics; stochastic algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications (ISDA), 2011 11th International Conference on
  • Conference_Location
    Cordoba
  • ISSN
    2164-7143
  • Print_ISBN
    978-1-4577-1676-8
  • Type

    conf

  • DOI
    10.1109/ISDA.2011.6121854
  • Filename
    6121854