Title :
Active learning for classification: An optimistic approach
Author :
Collet, Timothe ; Pietquin, Olivier
Author_Institution :
MaLIS Res. Group, Supelec, Gif-Sur-Yvette, France
Abstract :
In this paper, we propose to reformulate the active learning problem occurring in classification as a sequential decision making problem. We particularly focus on the problem of dynamically allocating a fixed budget of samples. This raises the problem of the trade off between exploration and exploitation which is traditionally addressed in the framework of the multi-armed bandits theory. Based on previous work on bandit theory applied to active learning for regression, we introduce four novel algorithms for solving the online allocation of the budget in a classification problem. Experiments on a generic classification problem demonstrate that these new algorithms compare positively to state-of-the-art methods.
Keywords :
decision making; learning (artificial intelligence); optimisation; pattern classification; regression analysis; active learning; classification; multiarmed bandits theory; optimistic approach; regression; sequential decision making problem; Algorithm design and analysis; Noise; Noise measurement; Partitioning algorithms; Resource management; Shape; Uncertainty;
Conference_Titel :
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on
Conference_Location :
Orlando, FL
DOI :
10.1109/ADPRL.2014.7010610