• DocumentCode
    2489752
  • Title

    Design and analysis of the WCCI 2010 active learning challenge

  • Author

    Guyon, Isabelle ; Cawley, Gavin ; Dror, Gideon ; Lemaire, Vincent

  • Author_Institution
    Clopinet, Berkeley, CA, USA
  • fYear
    2010
  • fDate
    18-23 July 2010
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    We organized a data mining challenge on “active learning” for IJCNN/WCCI 2010, addressing machine learning problems where labeling data is expensive, but large amounts of unlabeled data are available at low cost. Examples include handwriting and speech recognition, document classification, vision tasks, drug design using recombinant molecules and protein engineering. Such problems might be tackled from different angles: learning from unlabeled data or active learning. In the former case, the algorithms must satisfy themselves with the limited amount of labeled data and capitalize on the unlabeled data with semi-supervised learning methods. Several challenges have addressed this problem in the past. In the latter case, the algorithms may place a limited number of queries to get new sample labels. The goal in that case is to optimize the queries and the problem is referred to as active learning. While the problem of active learning is of great importance, organizing a challenge in that area is non trivial. This is the problem we have addressed, and we describe our approach in this paper. The “active learning” challenge is part of the WCCI 2010 competition program (http://www.wcci2010. org/competition-program). The website of the challenge remains open for submission of new methods beyond the termination of the challenge as a resource for students and researchers (http://clopinet.com/al).
  • Keywords
    data mining; document handling; handwriting recognition; learning (artificial intelligence); WCCI 2010 active learning challenge; data mining challenge; document classification; drug design; handwriting recognition; machine learning; semi-supervised learning methods; speech recognition; vision tasks; Data models; Environmental factors; Machine learning; Organisms; Predictive models; Protocols; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2010 International Joint Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1098-7576
  • Print_ISBN
    978-1-4244-6916-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2010.5596506
  • Filename
    5596506