DocumentCode
2489752
Title
Design and analysis of the WCCI 2010 active learning challenge
Author
Guyon, Isabelle ; Cawley, Gavin ; Dror, Gideon ; Lemaire, Vincent
Author_Institution
Clopinet, Berkeley, CA, USA
fYear
2010
fDate
18-23 July 2010
Firstpage
1
Lastpage
8
Abstract
We organized a data mining challenge on “active learning” for IJCNN/WCCI 2010, addressing machine learning problems where labeling data is expensive, but large amounts of unlabeled data are available at low cost. Examples include handwriting and speech recognition, document classification, vision tasks, drug design using recombinant molecules and protein engineering. Such problems might be tackled from different angles: learning from unlabeled data or active learning. In the former case, the algorithms must satisfy themselves with the limited amount of labeled data and capitalize on the unlabeled data with semi-supervised learning methods. Several challenges have addressed this problem in the past. In the latter case, the algorithms may place a limited number of queries to get new sample labels. The goal in that case is to optimize the queries and the problem is referred to as active learning. While the problem of active learning is of great importance, organizing a challenge in that area is non trivial. This is the problem we have addressed, and we describe our approach in this paper. The “active learning” challenge is part of the WCCI 2010 competition program (http://www.wcci2010. org/competition-program). The website of the challenge remains open for submission of new methods beyond the termination of the challenge as a resource for students and researchers (http://clopinet.com/al).
Keywords
data mining; document handling; handwriting recognition; learning (artificial intelligence); WCCI 2010 active learning challenge; data mining challenge; document classification; drug design; handwriting recognition; machine learning; semi-supervised learning methods; speech recognition; vision tasks; Data models; Environmental factors; Machine learning; Organisms; Predictive models; Protocols; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), The 2010 International Joint Conference on
Conference_Location
Barcelona
ISSN
1098-7576
Print_ISBN
978-1-4244-6916-1
Type
conf
DOI
10.1109/IJCNN.2010.5596506
Filename
5596506
Link To Document