• DocumentCode
    3695263
  • Title

    Deepdocclassifier: Document classification with deep Convolutional Neural Network

  • Author

    Muhammad Zeshan Afzal;Samuele Capobianco;Muhammad Imran Malik;Simone Marinai;Thomas M. Breuel;Andreas Dengel;Marcus Liwicki

  • Author_Institution
    Kaiserslautern University of Technology, Germany
  • fYear
    2015
  • Firstpage
    1111
  • Lastpage
    1115
  • Abstract
    This paper presents a deep Convolutional Neural Network (CNN) based approach for document image classification. One of the main requirement of deep CNN architecture is that they need huge number of samples for training. To overcome this problem we adopt a deep CNN which is trained using big image dataset containing millions of samples i.e., ImageNet. The proposed work outperforms both the traditional structure similarity methods and the CNN based approaches proposed earlier. The accuracy of the proposed approach with merely 20 images per class outperforms the state-of-the-art by achieving classification accuracy of 68.25%. The best results on Tobbacoo-3428 dataset show that our proposed method outperforms the state-of-the-art method by a significant margin and achieved a median accuracy of 77.6% with 100 samples per class used for training and validation.
  • Keywords
    "Marine vehicles","Convolutional codes","Electronic mail"
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
  • Type

    conf

  • DOI
    10.1109/ICDAR.2015.7333933
  • Filename
    7333933