Title :
Adapting BLSTM Neural Network Based Keyword Spotting Trained on Modern Data to Historical Documents
Author :
Frinken, Volkmar ; Fischer, Andreas ; Bunke, Horst ; Manmatha, R.
Author_Institution :
Inst. of Comput. Sci. & Appl. Math., Univ. of Bern, Bern, Switzerland
Abstract :
Being able to search for words or phrases in historic handwritten documents is of paramount importance when preserving cultural heritage. Storing scanned pages of written text can save the information from degradation, but it does not make the textual information readily available. Automatic keyword spotting systems for handwritten historic documents can fill this gap. However, most such systems have trouble with the great variety of writing styles. It is not uncommon for handwriting processing systems to be built for just a single book. In this paper we show that neural network based keyword spotting systems are flexible enough to be used successfully on historic data, even when they are trained on a modern handwriting database. We demonstrate that with little transcribed historic text, added to the training set, the performance can further be enhanced.
Keywords :
content-based retrieval; document image processing; handwriting recognition; neural nets; text analysis; BLSTM neural network; handwriting database; handwriting processing; handwritten historic document; keyword spotting system; Adaptation; Handwriting Recognition; Historical Data; Keyword Spotting; Neural Networks;
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
Conference_Location :
Kolkata
Print_ISBN :
978-1-4244-8353-2
DOI :
10.1109/ICFHR.2010.61