• DocumentCode
    1757855
  • Title

    Mining Weakly Labeled Web Facial Images for Search-Based Face Annotation

  • Author

    Dayong Wang ; Hoi, Steven C. H. ; Ying He ; Jianke Zhu

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • Volume
    26
  • Issue
    1
  • fYear
    2014
  • fDate
    Jan. 2014
  • Firstpage
    166
  • Lastpage
    179
  • Abstract
    This paper investigates a framework of search-based face annotation (SBFA) by mining weakly labeled facial images that are freely available on the World Wide Web (WWW). One challenging problem for search-based face annotation scheme is how to effectively perform annotation by exploiting the list of most similar facial images and their weak labels that are often noisy and incomplete. To tackle this problem, we propose an effective unsupervised label refinement (ULR) approach for refining the labels of web facial images using machine learning techniques. We formulate the learning problem as a convex optimization and develop effective optimization algorithms to solve the large-scale learning task efficiently. To further speed up the proposed scheme, we also propose a clustering-based approximation algorithm which can improve the scalability considerably. We have conducted an extensive set of empirical studies on a large-scale web facial image testbed, in which encouraging results showed that the proposed ULR algorithms can significantly boost the performance of the promising SBFA scheme.
  • Keywords
    Internet; approximation theory; convex programming; data mining; image retrieval; learning (artificial intelligence); pattern clustering; SBFA; ULR approach; WWW; World Wide Web; clustering-based approximation algorithm; convex optimization; large-scale Web facial image testbed; large-scale learning task; machine learning techniques; search-based face annotation; unsupervised label refinement approach; weakly labeled Web facial image mining; Approximation algorithms; Face; Feature extraction; Humans; Machine learning; Noise measurement; Optimization; Face annotation; content-based image retrieval; label refinement; machine learning; weak label; web facial images;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2012.240
  • Filename
    6381412