• DocumentCode
    3051928
  • Title

    Training set selection using entropy based distance

  • Author

    Kajdanowicz, Tomasz ; Plamowski, Slawomir ; Kazienko, Przemyslaw

  • Author_Institution
    Inst. of Infomratics, Wroclaw Univ. of Technol., Wroclaw, Poland
  • fYear
    2011
  • fDate
    6-8 Dec. 2011
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Distance measures, especially between probability density functions, are essential in solving machine learning problems. Among classification and clustering, data reduction and selection are some of them. In the paper a new distance measure for comparing and selecting training datasets is described. The distance between two datasets is based on variance of entropy in groups obtained by clustering joint datasets. The proposed approach is examined in dataset selection during prediction of debt portfolio value. Finally, basic evaluation on prediction performance is conducted.
  • Keywords
    data reduction; entropy; investment; learning (artificial intelligence); pattern classification; pattern clustering; probability; data reduction; data selection; debt portfolio value; distance measures; entropy based distance; joint dataset clustering; machine learning problems; probability density functions; training set selection; Entropy; Portfolios; Prediction algorithms; Probability density function; Testing; Training; Vectors; dataset selection; debt valuation; distance measures; intelligent systems; prediction methods; supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applied Electrical Engineering and Computing Technologies (AEECT), 2011 IEEE Jordan Conference on
  • Conference_Location
    Amman
  • Print_ISBN
    978-1-4577-1083-4
  • Type

    conf

  • DOI
    10.1109/AEECT.2011.6132530
  • Filename
    6132530