DocumentCode :
1428223
Title :
Active Learning for Solving the Incomplete Data Problem in Facial Age Classification by the Furthest Nearest-Neighbor Criterion
Author :
Wang, Jian-Gang ; Sung, Eric ; Yau, Wei-Yun
Author_Institution :
Instn. for Infocomm Res., Singapore, Singapore
Volume :
20
Issue :
7
fYear :
2011
fDate :
7/1/2011 12:00:00 AM
Firstpage :
2049
Lastpage :
2062
Abstract :
Facial age classification is an approach to classify face images into one of several predefined age groups. One of the difficulties in applying learning techniques to the age classification problem is the large amount of labeled training data required. Acquiring such training data is very costly in terms of age progress, privacy, human time, and effort. Although unlabeled face images can be obtained easily, it would be expensive to manually label them on a large scale and getting the ground truth. The frugal selection of the unlabeled data for labeling to quickly reach high classification performance with minimal labeling efforts is a challenging problem. In this paper, we present an active learning approach based on an online incremental bilateral two-dimension linear discriminant analysis (IB2DLDA) which initially learns from a small pool of labeled data and then iteratively selects the most informative samples from the unlabeled set to increasingly improve the classifier. Specifically, we propose a novel data selection criterion called the furthest nearest-neighbor (FNN) that generalizes the margin-based uncertainty to the multiclass case and which is easy to compute, so that the proposed active learning algorithm can handle a large number of classes and large data sizes efficiently. Empirical experiments on FG-NET and Morph databases together with a large unlabeled data set for age categorization problems show that the proposed approach can achieve results comparable or even outperform a conventionally trained active classifier that requires much more labeling effort. Our IB2DLDA-FNN algorithm can achieve similar results much faster than random selection and with fewer samples for age categorization. It also can achieve comparable results with active SVM but is much faster than active SVM in terms of training because kernel methods are not needed. The results on the face recognition database and palmprint/palm vein database showed that our approach can handle p- oblems with large number of classes. Our contributions in this paper are twofold. First, we proposed the IB2DLDA-FNN, the FNN being our novel idea, as a generic on-line or active learning paradigm. Second, we showed that it can be another viable tool for active learning of facial age range classification.
Keywords :
face recognition; image classification; learning (artificial intelligence); FG-NET; IB2DLDA-FNN algorithm; Morph databases; active learning; age categorization problems; conventionally trained active classifier; data selection criterion; face image classification; face recognition database; facial age classification; facial age range classification; furthest nearest-neighbor criterion; generic on-line paradigm; high classification performance; incomplete data problem; kernel methods; labeled training data; learning techniques; margin-based uncertainty; minimal labeling efforts; multiclass case; online incremental bilateral two-dimension linear discriminant analysis; palmprint/palm vein database; predefined age groups; unlabeled face images; Classification algorithms; Databases; Face; Labeling; Support vector machines; Training; Training data; Active learning; age estimation; furthest nearest neighbor; incomplete data problem; Adolescent; Adult; Age Factors; Algorithms; Artificial Intelligence; Biometric Identification; Child; Child, Preschool; Cluster Analysis; Databases, Factual; Discriminant Analysis; Face; Humans; Infant; Middle Aged;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2011.2106794
Filename :
5688457
Link To Document :
بازگشت