• DocumentCode
    1374512
  • Title

    Initialization Independent Clustering With Actively Self-Training Method

  • Author

    Nie, Feiping ; Xu, Dong ; Li, Xuelong

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Texas at Arlington, Arlington, TX, USA
  • Volume
    42
  • Issue
    1
  • fYear
    2012
  • Firstpage
    17
  • Lastpage
    27
  • Abstract
    The results of traditional clustering methods are usually unreliable as there is not any guidance from the data labels, while the class labels can be predicted more reliable by the semisupervised learning if the labels of partial data are given. In this paper, we propose an actively self-training clustering method, in which the samples are actively selected as training set to minimize an estimated Bayes error, and then explore semisupervised learning to perform clustering. Traditional graph-based semisupervised learning methods are not convenient to estimate the Bayes error; we develop a specific regularization framework on graph to perform semisupervised learning, in which the Bayes error can be effectively estimated. In addition, the proposed clustering algorithm can be readily applied in a semisupervised setting with partial class labels. Experimental results on toy data and real-world data sets demonstrate the effectiveness of the proposed clustering method on the unsupervised and the semisupervised setting. It is worthy noting that the proposed clustering method is free of initialization, while traditional clustering methods are usually dependent on initialization.
  • Keywords
    Bayes methods; graph theory; learning (artificial intelligence); pattern clustering; class labels; data labels; estimated Bayes error minimization; graph-based semisupervised learning methods; initialization independent clustering; selftraining clustering method; semisupervised learning; specific regularization framework; Clustering algorithms; Clustering methods; Laplace equations; Manifolds; Reliability; Semisupervised learning; Vectors; Active learning; initialization independent clustering; self-training; spectral clustering (SC); Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Decision Support Techniques; Models, Theoretical; Pattern Recognition, Automated;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2011.2161607
  • Filename
    6078436