Title :
Optimal data partition for semi-automated labeling
Author :
Lopresti, Daniel ; Nagy, G.
Author_Institution :
Lehigh Univ., Bethlehem, PA, USA
Abstract :
In a pattern recognition sequence consisting of alternating steps of interactive labeling, classifier training, and automated labeling (e.g., CAVIAR systems), the choice of sample size at each step affects the overall amount of human interaction necessary to label all the samples correctly. The appropriate splits depend on the error rate of the classifier as a function of the size of the training set and, perhaps surprisingly, are independent of the relative costs of interactive correction and confirmation. We model such a system and report the sequence of optimal data partitions for a representative range of parameters.
Keywords :
error analysis; human computer interaction; pattern classification; error rate; interactive labeling; optimal data partitions; pattern classifier; pattern recognition sequence; semiautomated labeling; training set; Computational modeling; Computers; Error analysis; Humans; Labeling; Pattern recognition; Training;
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
Print_ISBN :
978-1-4673-2216-4