DocumentCode
639549
Title
Sparse Output Coding for Large-Scale Visual Recognition
Author
Bin Zhao ; Xing, Eric P.
Author_Institution
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear
2013
fDate
23-28 June 2013
Firstpage
3350
Lastpage
3357
Abstract
Many vision tasks require a multi-class classifier to discriminate multiple categories, on the order of hundreds or thousands. In this paper, we propose sparse output coding, a principled way for large-scale multi-class classification, by turning high-cardinality multi-class categorization into a bit-by-bit decoding problem. Specifically, sparse output coding is composed of two steps: efficient coding matrix learning with scalability to thousands of classes, and probabilistic decoding. Empirical results on object recognition and scene classification demonstrate the effectiveness of our proposed approach.
Keywords
decoding; image classification; learning (artificial intelligence); matrix algebra; object recognition; probability; bit-by-bit decoding problem; coding matrix learning; high-cardinality multiclass categorization; large-scale multiclass classification; large-scale visual recognition; multiclass classifier; object recognition; probabilistic decoding; scene classification; sparse output coding; Accuracy; Decoding; Encoding; Gradient methods; Training data; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
Conference_Location
Portland, OR
ISSN
1063-6919
Type
conf
DOI
10.1109/CVPR.2013.430
Filename
6619274
Link To Document