DocumentCode
2535522
Title
Learning bilinear models for two-factor problems in vision
Author
Freeman, W.T. ; Tenenbaum, J.B.
Author_Institution
MERL, Cambridge, MA, USA
fYear
1997
fDate
17-19 Jun 1997
Firstpage
554
Lastpage
560
Abstract
In many vision problems, we want to infer two (or more) hidden factors which interact to produce our observations. We may want to disentangle illuminant and object colors in color constancy; rendering conditions from surface shape in shape-from-shading; face identity and head pose in face recognition; or font and letter class in character recognition. We refer to these two factors generically as “style” and “content”. Bilinear models offer a powerful framework for extracting the two-factor structure of a set of observations, and are familiar in computational vision from several well-known lines of research. This paper shows how bilinear models can be used to learn the style-content structure of a pattern analysis or synthesis problem, which can then be generalized to solve related tasks using different styles and/or content. We focus on three tasks: extrapolating the style of data to unseen content classes, classifying data with known content under a novel style, and translating data from novel content classes and style to a known style or content. We show examples from color constancy, face pose estimation, shape-from-shading, typography and speech
Keywords
computer vision; image recognition; learning (artificial intelligence); bilinear models; character recognition; color constancy; computational vision; face identity; face recognition; head pose; novel style; pattern analysis; rendering conditions; shape-from-shading; style-content structure; surface shape; two-factor problems; unseen content classes; vision problems; Character recognition; Computer vision; Extrapolation; Face detection; Face recognition; Head; Image recognition; Pattern analysis; Shape; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition, 1997. Proceedings., 1997 IEEE Computer Society Conference on
Conference_Location
San Juan
ISSN
1063-6919
Print_ISBN
0-8186-7822-4
Type
conf
DOI
10.1109/CVPR.1997.609380
Filename
609380
Link To Document