• DocumentCode
    427108
  • Title

    A new analysis of the value of unlabeled data in semi-supervised learning for image retrieval

  • Author

    Qi Tian ; Yu, Jie ; Xue, Qing ; Sebe, Nicu

  • Author_Institution
    Dept. of Comput. Sci., Texas Univ., San Antonio, TX
  • Volume
    2
  • fYear
    2004
  • fDate
    30-30 June 2004
  • Firstpage
    1019
  • Abstract
    There has been an increasing interest in using unlabeled data in semi-supervised learning for various classification problems. Previous work shows that unlabeled data can improve or degrade the classification performance depending on whether the model assumption matches the ground-truth data distribution, and also on the complexity of the classifier compared with the size of the labeled training set. In this paper, we provide a new analysis on the value of unlabeled data by considering different distributions of the labeled and unlabeled data and showing the migrating effect for semi-supervised learning. Extensive experiments have been performed in the context of image retrieval applications. Our approach evaluates the value of unlabeled data from a new aspect and is aimed to provide a guideline on how unlabeled data should be used
  • Keywords
    image classification; image retrieval; learning (artificial intelligence); statistical distributions; classification performance; classifier complexity; ground-truth data distribution; image retrieval; probabilistic distribution migrating effect; semi-supervised learning; unlabeled data value; Computer science; Degradation; Guidelines; Image analysis; Image retrieval; Information retrieval; Maximum likelihood estimation; Semisupervised learning; Text categorization; Web search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
  • Conference_Location
    Taipei
  • Print_ISBN
    0-7803-8603-5
  • Type

    conf

  • DOI
    10.1109/ICME.2004.1394376
  • Filename
    1394376