• DocumentCode
    3324051
  • Title

    Discriminative learning for script recognition

  • Author

    Rashid, Sheikh Faisal ; Shafait, Faisal ; Breuel, Thomas M.

  • Author_Institution
    Tech. Univ. of Kaiserslautern, Kaiserslautern, Germany
  • fYear
    2010
  • fDate
    26-29 Sept. 2010
  • Firstpage
    2145
  • Lastpage
    2148
  • Abstract
    Document script recognition is one of the important preprocessing steps in a multilingual optical character recognition (MOCR) system. A MOCR system requires prior knowledge of script to accurately recognize multilingual text in a single document. In multilingual documents two scripts can be mixed together within a single text line. Many existing script recognition methods lack the ability to recognize multiple scripts mixed within a single text line. Besides, these methods usually use script dependent features for script recognition thereby limiting their scope to particularly that script. In this paper we propose a discriminative learning approach for multi-script recognition at connected component level by using a convolutional neural network. The convolutional neural network combines feature extraction and script recognition process in one step and discriminative features for script recognition are extracted and learned as convolutional kernels from raw input. This eliminates the need for manually defining discriminative features for particular scripts. Results show above 95% script recognition accuracy at connected component level on datasets of Greek-Latin, Arabic-Latin multi-script documents and Antiqua-Fraktur documents. The proposed method can be easily adapted to different scripts.
  • Keywords
    authoring languages; feature extraction; neural nets; optical character recognition; convolutional neural network; discriminative learning; discriminative learning approach; document script recognition; feature extraction; multilingual optical character recognition system; Accuracy; Artificial neural networks; Colored noise; Feature extraction; Image color analysis; Training; Convolutional neural network; Discriminative learning; MOCR; Multi-script recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing (ICIP), 2010 17th IEEE International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1522-4880
  • Print_ISBN
    978-1-4244-7992-4
  • Electronic_ISBN
    1522-4880
  • Type

    conf

  • DOI
    10.1109/ICIP.2010.5650928
  • Filename
    5650928