DocumentCode
2222352
Title
Distinguishing mislabeled data from correctly labeled data in classifier design
Author
Venkataraman, Sundara ; Metaxas, Dimitris ; Fradkin, Dmitriy ; Kulikowski, Casimir ; Muchnik, Ilya
Author_Institution
Dept. of Comput. Sci., Rutgers Univ., USA
fYear
2004
fDate
15-17 Nov. 2004
Firstpage
668
Lastpage
672
Abstract
We have developed a method for distinguishing between correctly labeled and mislabeled data sampled from video sequences and used in the construction of a facial expression recognition classifier. The novelty of our approach lies in training a single, optimal classifier type (a support vector machine, or SVM) on multiple representations of the data, involving different "discriminating" subspaces. Results of a preliminary study on the discrimination of "high stress" vs. "low stress" facial expression data by this method confirms that our novel approach is able to distinguish subproblems where labeling is highly reliable from those where mislabeling can lead to high error rates. In helping detect data subsamples which yield misleading classification results, the method is also a rapid, highly efficient cross-validated approach for eliminating outliers.
Keywords
face recognition; gesture recognition; image classification; image sampling; image sequences; learning (artificial intelligence); support vector machines; data subsamples; facial expression recognition classifier; mislabeled data; support vector machine; training data; video sequences; Acoustic noise; Bagging; Computer science; Labeling; Neural networks; Speech; Support vector machine classification; Support vector machines; Video sequences; Voting;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence, 2004. ICTAI 2004. 16th IEEE International Conference on
ISSN
1082-3409
Print_ISBN
0-7695-2236-X
Type
conf
DOI
10.1109/ICTAI.2004.52
Filename
1374252
Link To Document