Title :
Distinguishing mislabeled data from correctly labeled data in classifier design
Author :
Venkataraman, Sundara ; Metaxas, Dimitris ; Fradkin, Dmitriy ; Kulikowski, Casimir ; Muchnik, Ilya
Author_Institution :
Dept. of Comput. Sci., Rutgers Univ., USA
Abstract :
We have developed a method for distinguishing between correctly labeled and mislabeled data sampled from video sequences and used in the construction of a facial expression recognition classifier. The novelty of our approach lies in training a single, optimal classifier type (a support vector machine, or SVM) on multiple representations of the data, involving different "discriminating" subspaces. Results of a preliminary study on the discrimination of "high stress" vs. "low stress" facial expression data by this method confirms that our novel approach is able to distinguish subproblems where labeling is highly reliable from those where mislabeling can lead to high error rates. In helping detect data subsamples which yield misleading classification results, the method is also a rapid, highly efficient cross-validated approach for eliminating outliers.
Keywords :
face recognition; gesture recognition; image classification; image sampling; image sequences; learning (artificial intelligence); support vector machines; data subsamples; facial expression recognition classifier; mislabeled data; support vector machine; training data; video sequences; Acoustic noise; Bagging; Computer science; Labeling; Neural networks; Speech; Support vector machine classification; Support vector machines; Video sequences; Voting;
Conference_Titel :
Tools with Artificial Intelligence, 2004. ICTAI 2004. 16th IEEE International Conference on
Print_ISBN :
0-7695-2236-X
DOI :
10.1109/ICTAI.2004.52