• DocumentCode
    3134966
  • Title

    Statistical Text Line Analysis in Handwritten Documents

  • Author

    Bosch, Vicente ; Toselli, Alejandro Hector ; Vidal, Enrique

  • Author_Institution
    Inst. Tec. de Inf., Univ. Politec. Valencia, Valencia, Spain
  • fYear
    2012
  • fDate
    18-20 Sept. 2012
  • Firstpage
    201
  • Lastpage
    206
  • Abstract
    In this paper we present an approach for text line analysis and detection in handwritten documents based on Hidden Markov Models, a technique widely used in other handwritten and speech recognition tasks. It is shown that text line analysis and detection can be solved using a more formal methodology in contraposition to most of the proposed heuristic approaches found in the literature. Our approach not only provides the best position coordinates for each of the vertical page regions but also labels them, in this manner surpassing the traditional heuristic methods. In our experiments we demonstrate the performance of the approach (both in line analysis and detection) and study the impact of increasingly constrained "vertical layout language models" on text line detection accuracy. Through this experimentation we also show the improvement in quality of the baselines yielded by our approach in comparison with a state-of-the-art heuristic method based on vertical projection profiles.
  • Keywords
    document image processing; handwritten character recognition; hidden Markov models; statistical analysis; text detection; handwritten document; hidden Markov model; position coordinate; statistical text line analysis; text line detection; vertical layout language model; vertical page region; vertical projection profile; Accuracy; Feature extraction; Grammar; Hidden Markov models; Text recognition; Training; Vectors; Handwritten Text; Hidden Markov Models; Text line analysis; Text line detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
  • Conference_Location
    Bari
  • Print_ISBN
    978-1-4673-2262-1
  • Type

    conf

  • DOI
    10.1109/ICFHR.2012.274
  • Filename
    6424392