• DocumentCode
    2197140
  • Title

    Adapting BLSTM Neural Network Based Keyword Spotting Trained on Modern Data to Historical Documents

  • Author

    Frinken, Volkmar ; Fischer, Andreas ; Bunke, Horst ; Manmatha, R.

  • Author_Institution
    Inst. of Comput. Sci. & Appl. Math., Univ. of Bern, Bern, Switzerland
  • fYear
    2010
  • fDate
    16-18 Nov. 2010
  • Firstpage
    352
  • Lastpage
    357
  • Abstract
    Being able to search for words or phrases in historic handwritten documents is of paramount importance when preserving cultural heritage. Storing scanned pages of written text can save the information from degradation, but it does not make the textual information readily available. Automatic keyword spotting systems for handwritten historic documents can fill this gap. However, most such systems have trouble with the great variety of writing styles. It is not uncommon for handwriting processing systems to be built for just a single book. In this paper we show that neural network based keyword spotting systems are flexible enough to be used successfully on historic data, even when they are trained on a modern handwriting database. We demonstrate that with little transcribed historic text, added to the training set, the performance can further be enhanced.
  • Keywords
    content-based retrieval; document image processing; handwriting recognition; neural nets; text analysis; BLSTM neural network; handwriting database; handwriting processing; handwritten historic document; keyword spotting system; Adaptation; Handwriting Recognition; Historical Data; Keyword Spotting; Neural Networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
  • Conference_Location
    Kolkata
  • Print_ISBN
    978-1-4244-8353-2
  • Type

    conf

  • DOI
    10.1109/ICFHR.2010.61
  • Filename
    5693548