• DocumentCode
    591951
  • Title

    Separability versus Prototypicality in Handwritten Word Retrieval

  • Author

    Van Oosten, Jean-Paul ; Schomaker, Lambert

  • Author_Institution
    Dept. of Artificial Intell., Univ. of Groningen, Groningen, Netherlands
  • fYear
    2012
  • fDate
    18-20 Sept. 2012
  • Firstpage
    8
  • Lastpage
    13
  • Abstract
    User appreciation of a word-image retrieval system is based on the quality of a hit list for a query. Using support vector machines for ranking in large scale, handwritten document collections, we observed that many hit lists suffered from bad instances in the top ranks. An analysis of this problem revealed that two functions needed to be optimised concerning both separability and prototypicality. By ranking images in two stages, the number of distracting images is reduced, making the method very convenient for massive scale, continuously trainable retrieval engines. Instead of cumbersome SVM training, we present a nearest-centroid method and show that precision improvements of up to 35 percentage points can be achieved, yielding up to 100% precision in data sets with a large amount of instances, while maintaining high recall performances.
  • Keywords
    handwritten character recognition; image retrieval; support vector machines; SVM training; handwritten document collection; handwritten word retrieval; image ranking; nearest-centroid method; support vector machine; word-image retrieval system; Accuracy; Engines; Humans; Prototypes; Shape; Support vector machines; Training; Handwriting recognition; Handwritten word retrieval; Prototypicality; Ranking; Separability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
  • Conference_Location
    Bari
  • Print_ISBN
    978-1-4673-2262-1
  • Type

    conf

  • DOI
    10.1109/ICFHR.2012.269
  • Filename
    6424363