• DocumentCode
    1135242
  • Title

    IDD: A Supervised Interval Distance-Based Method for Discretization

  • Author

    Ruiz, Francisco J. ; Angulo, Cecilio ; Agell, Núria

  • Author_Institution
    Dept. of Autom. Control, Tech. Univ. of Catalonia, Vilanova i la Geltru
  • Volume
    20
  • Issue
    9
  • fYear
    2008
  • Firstpage
    1230
  • Lastpage
    1238
  • Abstract
    This article introduces a new method for supervised discretization based on interval distances by using a novel concept of neighbourhood in the target´s space. The method proposed takes into consideration the order of the class attribute, when this exists, so that it can be used with ordinal discrete classes as well as continuous classes, in the case of regression problems. The method has proved to be very efficient in terms of accuracy and faster than the most commonly supervised discretization methods used in the literature. It is illustrated through several examples and a comparison with other standard discretization methods is performed for three public data sets by using two different learning tasks: a decision tree algorithm and SVM for regression.
  • Keywords
    decision trees; regression analysis; support vector machines; IDD; SVM; class attribute; continuous classes; decision tree algorithm; ordinal discrete classes; regression problems; standard discretization methods; supervised discretization; supervised interval distance-based method; Clustering; Interval arithmetic; Mining methods and algorithms; and association rules; classification;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2008.66
  • Filename
    4492776