• DocumentCode
    3695133
  • Title

    Paragraph text segmentation into lines with Recurrent Neural Networks

  • Author

    Bastien Moysset;Christopher Kermorvant;Christian Wolf;Jérôme Louradour

  • Author_Institution
    A2iA SA, Paris, France
  • fYear
    2015
  • Firstpage
    456
  • Lastpage
    460
  • Abstract
    The detection of text lines, as a first processing step, is critical in all text recognition systems. State-of-the-art methods to locate lines of text are based on handcrafted heuristics fine-tuned by the image processing community´s experience. They succeed under certain constraints; for instance the background has to be roughly uniform. We propose to use more “agnostic” Machine Learning-based approaches to address text line location. The main motivation is to be able to process either damaged documents, or flows of documents with a high variety of layouts and other characteristics. A new method is presented in this work, inspired by the latest generation of optical models used for text recognition, namely Recurrent Neural Networks. As these models are sequential, a column of text lines in our application plays here the same role as a line of characters in more traditional text recognition settings. A key advantage of the proposed method over other data-driven approaches is that compiling a training dataset does not require labeling line boundaries: only the number of lines are required for each paragraph. Experimental results show that our approach gives similar or better results than traditional handcrafted approaches, with little engineering efforts and less hyper-parameter tuning.
  • Keywords
    "Training","Tuning","Handwriting recognition","Text analysis","Photonics"
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
  • Type

    conf

  • DOI
    10.1109/ICDAR.2015.7333803
  • Filename
    7333803