مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning Expressionlets on Spatio-temporal Manifold for Dynamic Facial Expression Recognition

DocumentCode :

253931

Title :

Learning Expressionlets on Spatio-temporal Manifold for Dynamic Facial Expression Recognition

Author :

Mengyi Liu ; Shiguang Shan ; Ruiping Wang ; Xilin Chen

Author_Institution :

Key Lab. of Intell. Inf. Process., Inst. of Comput. Technol., Beijing, China

fYear :

2014

fDate :

23-28 June 2014

Firstpage :

1749

Lastpage :

1756

Abstract :

Facial expression is temporally dynamic event which can be decomposed into a set of muscle motions occurring in different facial regions over various time intervals. For dynamic expression recognition, two key issues, temporal alignment and semantics-aware dynamic representation, must be taken into account. In this paper, we attempt to solve both problems via manifold modeling of videos based on a novel mid-level representation, i.e. expressionlet. Specifically, our method contains three key components: 1) each expression video clip is modeled as a spatio-temporal manifold (STM) formed by dense low-level features, 2) a Universal Manifold Model (UMM) is learned over all low-level features and represented as a set of local ST modes to statistically unify all the STMs. 3) the local modes on each STM can be instantiated by fitting to UMM, and the corresponding expressionlet is constructed by modeling the variations in each local ST mode. With above strategy, expression videos are naturally aligned both spatially and temporally. To enhance the discriminative power, the expressionlet-based STM representation is further processed with discriminant embedding. Our method is evaluated on four public expression databases, CK+, MMI, Oulu-CASIA, and AFEW. In all cases, our method reports results better than the known state-of-the-art.

Keywords :

emotion recognition; face recognition; video signal processing; visual databases; AFEW databases; CK+ databases; MMI databases; Oulu-CASIA databases; UMM; dense low-level features; dynamic facial expression recognition; expressionlet-based STM representation; facial regions; local ST modes; manifold modeling; mid-level representation; muscle motions; semantic-aware dynamic representation; spatiotemporal manifold; temporally dynamic event; universal manifold model; video clip; video manifold modeling; Covariance matrices; Databases; Face recognition; Feature extraction; Manifolds; Three-dimensional displays; Training;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on

Conference_Location :

Columbus, OH

Type :

conf

DOI :

10.1109/CVPR.2014.226

Filename :

6909622

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=253931