• Title of article

    Segmentation-free word spotting with exemplar SVMs

  • Author/Authors

    Almazلn، نويسنده , , Jon and Gordo، نويسنده , , Albert and Fornés، نويسنده , , Alicia and Valveny، نويسنده , , Ernest، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2014
  • Pages
    12
  • From page
    3967
  • To page
    3978
  • Abstract
    In this paper we propose an unsupervised segmentation-free method for word spotting in document images. Documents are represented with a grid of HOG descriptors, and a sliding-window approach is used to locate the document regions that are most similar to the query. We use the Exemplar SVM framework to produce a better representation of the query in an unsupervised way. Then, we use a more discriminative representation based on Fisher Vector to rerank the best regions retrieved, and the most promising ones are used to expand the Exemplar SVM training set and improve the query representation. Finally, the document descriptors are precomputed and compressed with Product Quantization. This offers two advantages: first, a large number of documents can be kept in RAM memory at the same time. Second, the sliding window becomes significantly faster since distances between quantized HOG descriptors can be precomputed. Our results significantly outperform other segmentation-free methods in the literature, both in accuracy and in speed and memory usage.
  • Keywords
    Word spotting , Segmentation-free , unsupervised learning , Reranking , Query expansion , Compression
  • Journal title
    PATTERN RECOGNITION
  • Serial Year
    2014
  • Journal title
    PATTERN RECOGNITION
  • Record number

    1736727