مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning methods for generic object recognition with invariance to pose and lighting

DocumentCode :

3207772

Title :

Learning methods for generic object recognition with invariance to pose and lighting

Author :

LeCun, Yann ; Huang, Fu Jie ; Bottou, Léon

Author_Institution :

Courant Inst., New York Univ., NY, USA

Volume :

fYear :

2004

fDate :

27 June-2 July 2004

Abstract :

We assess the applicability of several popular learning methods for the problem of recognizing generic visual categories with invariance to pose, lighting, and surrounding clutter. A large dataset comprising stereo image pairs of 50 uniform-colored toys under 36 azimuths, 9 elevations, and 6 lighting conditions was collected (for a total of 194,400 individual images). The objects were 10 instances of 5 generic categories: four-legged animals, human figures, airplanes, trucks, and cars. Five instances of each category were used for training, and the other five for testing. Low-resolution grayscale images of the objects with various amounts of variability and surrounding clutter were used for training and testing. Nearest neighbor methods, support vector machines, and convolutional networks, operating on raw pixels or on PCA-derived features were tested. Test error rates for unseen object instances placed on uniform backgrounds were around 13% for SVM and 7% for convolutional nets. On a segmentation/recognition task with highly cluttered images, SVM proved impractical, while convolutional nets yielded 16/7% error. A real-time version of the system was implemented that can detect and classify objects in natural scenes at around 10 frames per second.

Keywords :

computer vision; feature extraction; image segmentation; object detection; principal component analysis; stereo image processing; support vector machines; very large databases; visual databases; convolutional networks; generic object recognition; image segmentation; learning methods; lighting invariance; low-resolution grayscale images; nearest neighbor methods; pose invariance; support vector machines; test error rates; Airplanes; Animals; Azimuth; Gray-scale; Humans; Learning systems; Object recognition; Support vector machine classification; Support vector machines; Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proceedings of the 2004 IEEE Computer Society Conference on

ISSN :

1063-6919

Print_ISBN :

0-7695-2158-4

Type :

conf

DOI :

10.1109/CVPR.2004.1315150

Filename :

1315150

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3207772