Author_Institution :
Sch. of Electr. Eng. & Autom., Harbin Inst. of Technol., Harbin, China
Abstract :
The tracking and recognition of facial activities from images or videos have attracted great attention in computer vision field. Facial activities are characterized by three levels. First, in the bottom level, facial feature points around each facial component, i.e., eyebrow, mouth, etc., capture the detailed face shape information. Second, in the middle level, facial action units, defined in the facial action coding system, represent the contraction of a specific set of facial muscles, i.e., lid tightener, eyebrow raiser, etc. Finally, in the top level, six prototypical facial expressions represent the global facial muscle movement and are commonly used to describe the human emotion states. In contrast to the mainstream approaches, which usually only focus on one or two levels of facial activities, and track (or recognize) them separately, this paper introduces a unified probabilistic framework based on the dynamic Bayesian network to simultaneously and coherently represent the facial evolvement in different levels, their interactions and their observations. Advanced machine learning methods are introduced to learn the model based on both training data and subjective prior knowledge. Given the model and the measurements of facial motions, all three levels of facial activities are simultaneously recognized through a probabilistic inference. Extensive experiments are performed to illustrate the feasibility and effectiveness of the proposed model on all three level facial activities.
Keywords :
computer vision; face recognition; feature extraction; image coding; image motion analysis; learning (artificial intelligence); shape recognition; advanced machine learning method; computer vision field; dynamic Bayesian network; eyebrow; eyebrow raiser; face shape information; facial action coding system; facial action units; facial activities; facial component; facial evolvement; facial expression recognition; facial feature points; facial motion measurement; facial motion model; global facial muscle movement; human emotion states; lid tightener; mainstream approach; mouth; probabilistic inference; prototypical facial expressions; simultaneous facial feature tracking; subjective prior knowledge; training data and; unified probabilistic framework; Face; Face recognition; Facial features; Gold; Modeling; Mouth; Shape; Bayesian network; expression recognition; facial action unit recognition; facial feature tracking; simultaneous tracking and recognition; Algorithms; Bayes Theorem; Biometric Identification; Face; Facial Expression; Humans; Image Processing, Computer-Assisted;