• DocumentCode
    2142314
  • Title

    Fast Key-Word Searching via Embedding and Active-DTW

  • Author

    Saabni, Raid ; Bronstein, Alex

  • Author_Institution
    Triangle R&D Center, Tel-Aviv Univ., Tel-Aviv, Israel
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    68
  • Lastpage
    72
  • Abstract
    In this paper we present a novel approach for fast search of handwritten Arabic word-parts within large lexicons. The algorithm runs through three steps to achieve the required results. First it warps multiple appearances of each word-part in the lexicon for embedding into the same euclidean space. The embedding is done based on the warping path produced by the Dynamic Time Warping (DTW) process while calculating the similarity distance. In the next step, all samples of different word-parts are resampled uniformly to the same size. The kd-tree structure is used to store all shapes representing word parts in the lexicon. Fast approximation of k-nearest neighbors generates a short list of candidates to be presented to the next step. In the third step, the Active-DTW [15] algorithm is used to examine each sample in the short list and give final accurate results. We demonstrate our method on a database of 23,500 images of word-parts extracted from the IFN/ENIT database [6] and 22,000 images collected from 93 writers. Our method achieves a speedup of 5 orders of magnitude over the exact method, at the cost of only a 3.8% reduction in accuracy.
  • Keywords
    feature extraction; handwriting recognition; Euclidean space; active-DTW; dynamic time warping; embedding process; handwritten Arabic word-part; k-nearest neighbor approximation; kd-tree structure; key-word searching; word-part image; Approximation methods; Artificial neural networks; Databases; Handwriting recognition; Libraries; Shape; Vectors; Dynamic Time Warping; Embedding; Handwriting Recognition; Nearest Neighbor; Word Searching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2011 International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4577-1350-7
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2011.23
  • Filename
    6065278