• DocumentCode
    3695174
  • Title

    Segmentation-free query-by-string word spotting with Bag-of-Features HMMs

  • Author

    Leonard Rothacker;Gernot A. Fink

  • Author_Institution
    Department of Computer Science, TU Dortmund University, 44221, Germany
  • fYear
    2015
  • Firstpage
    661
  • Lastpage
    665
  • Abstract
    Word spotting allows to explore document images without requiring a full transcription. In the query-by-string scenario considered in this paper, it is possible to search arbitrary keywords while only limited prior information about the documents is required. We learn context-dependent character models from a training set that is small with respect to the number of models. This is possible due to the use of Bag-of-Features HMMs that are especially suited for estimating robust models from limited training material. In contrast to most query-by-string methods we consider a fully segmentation-free decoding framework that does not require any pre-segmentation on word or line level. Experiments on the well-known George Washington benchmark demonstrate the high accuracy of our method.
  • Keywords
    "Hidden Markov models","Image segmentation","Robustness","Image recognition","Computational modeling","Context modeling","Context"
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
  • Type

    conf

  • DOI
    10.1109/ICDAR.2015.7333844
  • Filename
    7333844