• DocumentCode
    615114
  • Title

    A scalable metric learning-based voting method for expression recognition

  • Author

    Shaohua Wan ; Aggarwal, J.K.

  • Author_Institution
    Dept. of ECE, Univ. of Texas at Austin, Austin, TX, USA
  • fYear
    2013
  • fDate
    22-26 April 2013
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    In this paper, we propose a facial expression classification method using metric learning-based k-nearest neighbor voting. To achieve accurate classification of a facial expression from frontal face image, we first learn a distance metric structure from training data that characterizes the feature space pattern, then use this metric to retrieve nearest neighbors from training dataset, and finally output the classification decision accordingly. An expression is represented as a fusion of face shape and texture. This representation is based on registering a face image with landmarking shape model and extracting Gabor features from local patches around landmarks. This type of representation achieves robustness and effectiveness by using an ensemble of local patch feature detector at a global shape level. A naive implementation of metric learning-based k-nearest neighbor would incur a time complexity proportional to the size of the training dataset, which precludes this method being used with enormous dataset. To scale to potential larger databases, an approximate yet efficient variant scheme of ML-based kNN voting is further devised based on Locality Sensitive Hashing (LSH). A query example is directly hashed to the bucket of a pre-computed hash table where candidate nearest neighbors can be found and there is no need to search the entire database for nearest neighbors. Experimental results on Cohn-Kanade database and Moving Faces and People database show that both ML-based kNN voting and its LSH approximation outperform the state-of-the-art, demonstrating the superiority and scalability of our method.
  • Keywords
    Gabor filters; approximation theory; computational complexity; emotion recognition; face recognition; feature extraction; image classification; image fusion; image registration; image representation; image texture; Cohn-Kanade database; Gabor feature extraction; LSH approximation; ML-based kNN voting; classification decision; distance metric structure; expression recognition; expression representation; face image registration; face shape fusion; facial expression classification method; frontal face image; landmarking shape model; local patch feature detector; locality sensitive hashing; metric learning-based k-nearest neighbor voting; moving face database; moving people database; naive implementation; nearest neighbor retrieval; texture fusion; time complexity; Databases; Face; Feature extraction; Measurement; Shape; Training; Vectors; Gabor feature; K-Nearest Neighbor; Locality sensitive Hashing; Metric Learning; emotion recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Face and Gesture Recognition (FG), 2013 10th IEEE International Conference and Workshops on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-5545-2
  • Electronic_ISBN
    978-1-4673-5544-5
  • Type

    conf

  • DOI
    10.1109/FG.2013.6553753
  • Filename
    6553753