DocumentCode :
2919227
Title :
Learning better image representations using ‘flobject analysis’
Author :
Li, Patrick S. ; Givoni, Inmar E. ; Frey, Brendan J.
Author_Institution :
Univ. of Toronto, Toronto, ON, Canada
fYear :
2011
fDate :
20-25 June 2011
Firstpage :
2721
Lastpage :
2728
Abstract :
Unsupervised learning can be used to extract image representations that are useful for various and diverse vision tasks. After noticing that most biological vision systems for interpreting static images are trained using disparity information, we developed an analogous framework for unsupervised learning. The output of our method is a model that can generate a vector representation or descriptor from any static image. However, the model is trained using pairs of consecutive video frames, which are used to find representations that are consistent with optical flow-derived objects, or `flobjects´. To demonstrate the flobject analysis framework, we extend the latent Dirichlet allocation bag-of-words model to account for real-valued word-specific flow vectors and image-specific probabilistic associations between flow clusters and topics. We show that the static image representations extracted using our method can be used to achieve higher classification rates and better generalization than standard topic models, spatial pyramid matching and gist descriptors.
Keywords :
computer vision; feature extraction; image representation; image sequences; pattern clustering; unsupervised learning; biological vision systems; disparity information; flobject analysis; flow clusters; gist descriptors; image representation extraction; image-specific probabilistic associations; latent Dirichlet allocation bag-of-words model; optical flow-derived objects; real-valued word-specific flow vectors; spatial pyramid matching; standard topic models; static images; unsupervised learning; vector representation; video frames; Analytical models; Feature extraction; Histograms; Kernel; Optical imaging; Support vector machines; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4577-0394-2
Type :
conf
DOI :
10.1109/CVPR.2011.5995649
Filename :
5995649
Link To Document :
بازگشت