DocumentCode :
178477
Title :
Convolutional Neural Networks for Document Image Classification
Author :
Le Kang ; Kumar, Jayant ; Peng Ye ; Yi Li ; Doermann, David
Author_Institution :
Univ. of Maryland, College Park, MD, USA
fYear :
2014
fDate :
24-28 Aug. 2014
Firstpage :
3168
Lastpage :
3172
Abstract :
This paper presents a Convolutional Neural Network (CNN) for document image classification. In particular, document image classes are defined by the structural similarity. Previous approaches rely on hand-crafted features for capturing structural information. In contrast, we propose to learn features from raw image pixels using CNN. The use of CNN is motivated by the the hierarchical nature of document layout. Equipped with rectified linear units and trained with dropout, our CNN performs well even when document layouts present large inner-class variations. Experiments on public challenging datasets demonstrate the effectiveness of the proposed approach.
Keywords :
document image processing; feature extraction; image classification; neural nets; CNN; convolutional neural networks; document image classification; document layout; hand-crafted features; inner-class variations; public challenging datasets; raw image pixels; rectified linear units; structural similarity; Accuracy; Computer vision; Kernel; Layout; NIST; Pattern recognition; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location :
Stockholm
ISSN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2014.546
Filename :
6977258
Link To Document :
بازگشت