• Title of article

    A systematic analysis of performance measures for classification tasks

  • Author/Authors

    Marina Sokolova، نويسنده , , Leila Kosseim & Guy Lapalme، نويسنده ,

  • Issue Information
    دوماهنامه با شماره پیاپی سال 2009
  • Pages
    11
  • From page
    427
  • To page
    437
  • Abstract
    This paper presents a systematic analysis of twenty four performance measures used in the complete spectrum of Machine Learning classification tasks, i.e., binary, multi-class, multi-labelled, and hierarchical. For each classification task, the study relates a set of changes in a confusion matrix to specific characteristics of data. Then the analysis concentrates on the type of changes to a confusion matrix that do not change a measure, therefore, preserve a classifier’s evaluation (measure invariance). The result is the measure invariance taxonomy with respect to all relevant label distribution changes in a classification problem. This formal analysis is supported by examples of applications where invariance properties of measures lead to a more reliable evaluation of classifiers. Text classification supplements the discussion with several case studies.
  • Keywords
    Text classification , Performance Evaluation , Machine Learning
  • Journal title
    Information Processing and Management
  • Serial Year
    2009
  • Journal title
    Information Processing and Management
  • Record number

    1228954