• DocumentCode
    2211044
  • Title

    KB-CB-N classification: Towards unsupervised approach for supervised learning

  • Author

    Abdallah, Zahraa Said ; Gaber, Mohamed Medhat

  • Author_Institution
    Centre for Distrib. Syst. & Software Eng., Monash Univ., Caulfield East, VIC, Australia
  • fYear
    2011
  • fDate
    11-15 April 2011
  • Firstpage
    283
  • Lastpage
    290
  • Abstract
    Data classification has attracted considerable research attention in the field of computational statistics and data mining due to its wide range of applications. K Best Cluster Based Neighbour (KB-CB-N) is our novel classification technique based on the integration of three different similarity measures for cluster based classification. The basic principle is to apply unsupervised learning on the instances of each class in the dataset and then use the output as an input for the classification algorithm to find the K best neighbours of clusters from the density, gravity and distance perspectives. Clustering is applied as an initial step within each class to find the inherent in-class grouping in the dataset. Different data clustering techniques use different similarity measures. Each measure has its own strength and weakness. Thus, combining the three measures can benefit from the strength of each one and eliminate encountered problems of using an individual measure. Extensive experimental results using eight real datasets have evidenced that our new technique typically shows improved or equivalent performance over other existing state-of-the-art classification methods.
  • Keywords
    data mining; pattern classification; pattern clustering; statistical analysis; unsupervised learning; K best cluster based neighbour classification; KB-CB-N classification; classification algorithm; computational statistics; data classification; data clustering; data mining; in-class grouping; similarity measures; unsupervised learning; Classification algorithms; Clustering algorithms; Density measurement; Euclidean distance; Gravity; Prediction algorithms; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Data Mining (CIDM), 2011 IEEE Symposium on
  • Conference_Location
    Paris
  • Print_ISBN
    978-1-4244-9926-7
  • Type

    conf

  • DOI
    10.1109/CIDM.2011.5949435
  • Filename
    5949435