مرکز منطقه ای اطلاع رساني علوم و فناوري - A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images

DocumentCode :

2016329

Title :

A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images

Author :

Ranzato, Marc Aurelio ; LeCun, Yann

Author_Institution :

New York Univ., New York

Volume :

fYear :

2007

fDate :

23-26 Sept. 2007

Firstpage :

1213

Lastpage :

1217

Abstract :

We describe an unsupervised learning algorithm for extracting sparse and locally shift-invariant features. We also devise a principled procedure for learning hierarchies of invariant features. Each feature detector is composed of a set of trainable convolutional filters followed by a max-pooling layer over non-overlapping windows, and a point-wise sigmoid non-linearity. A second stage of more invariant features is fed with patches provided by the first stage feature extractor, and is trained in the same way. The method is used to pre-train the first four layers of a deep convolutional network which achieves state-of-the-art performance on the MNIST dataset of handwritten digits. The final testing error rate is equal to 0.42%. Preliminary experiments on compression of bitonal document images show very promising results in terms of compression ratio and reconstruction error.

Keywords :

document image processing; feature extraction; filtering theory; unsupervised learning; bitonal document images; compression ratio; convolutional filters; deep convolutional network; feature detector; locally shift invariant feature extractor; reconstruction error; unsupervised learning algorithm; Computer architecture; Computer vision; Decoding; Detectors; Euclidean distance; Feature extraction; Filters; Image coding; Image reconstruction; Unsupervised learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on

Conference_Location :

Parana

ISSN :

1520-5363

Print_ISBN :

978-0-7695-2822-9

Type :

conf

DOI :

10.1109/ICDAR.2007.4377108

Filename :

4377108

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2016329