• DocumentCode
    3004490
  • Title

    Learning from ambiguously labeled images

  • Author

    Cour, Timothee ; Sapp, Brian ; Jordan, Christopher ; Taskar, Ben

  • Author_Institution
    Univ. of Pennsylvania, Philadelphia, PA, USA
  • fYear
    2009
  • fDate
    20-25 June 2009
  • Firstpage
    919
  • Lastpage
    926
  • Abstract
    In many image and video collections, we have access only to partially labeled data. For example, personal photo collections often contain several faces per image and a caption that only specifies who is in the picture, but not which name matches which face. Similarly, movie screenplays can tell us who is in the scene, but not when and where they are on the screen. We formulate the learning problem in this setting as partially-supervised multiclass classification where each instance is labeled ambiguously with more than one label. We show theoretically that effective learning is possible under reasonable assumptions even when all the data is weakly labeled. Motivated by the analysis, we propose a general convex learning formulation based on minimization of a surrogate loss appropriate for the ambiguous label setting. We apply our framework to identifying faces culled from Web news sources and to naming characters in TV series and movies. We experiment on a very large dataset consisting of 100 hours of video, and in particular achieve 6% error for character naming on 16 episodes of LOST.
  • Keywords
    image classification; learning (artificial intelligence); minimisation; video signal processing; Web news source; ambiguously labeled image; convex learning formulation; learning problem; partially-supervised multiclass classification; surrogate loss appropriate minimization; video collection; Layout; Motion pictures; TV;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on
  • Conference_Location
    Miami, FL
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4244-3992-8
  • Type

    conf

  • DOI
    10.1109/CVPR.2009.5206667
  • Filename
    5206667