DocumentCode
2197140
Title
Adapting BLSTM Neural Network Based Keyword Spotting Trained on Modern Data to Historical Documents
Author
Frinken, Volkmar ; Fischer, Andreas ; Bunke, Horst ; Manmatha, R.
Author_Institution
Inst. of Comput. Sci. & Appl. Math., Univ. of Bern, Bern, Switzerland
fYear
2010
fDate
16-18 Nov. 2010
Firstpage
352
Lastpage
357
Abstract
Being able to search for words or phrases in historic handwritten documents is of paramount importance when preserving cultural heritage. Storing scanned pages of written text can save the information from degradation, but it does not make the textual information readily available. Automatic keyword spotting systems for handwritten historic documents can fill this gap. However, most such systems have trouble with the great variety of writing styles. It is not uncommon for handwriting processing systems to be built for just a single book. In this paper we show that neural network based keyword spotting systems are flexible enough to be used successfully on historic data, even when they are trained on a modern handwriting database. We demonstrate that with little transcribed historic text, added to the training set, the performance can further be enhanced.
Keywords
content-based retrieval; document image processing; handwriting recognition; neural nets; text analysis; BLSTM neural network; handwriting database; handwriting processing; handwritten historic document; keyword spotting system; Adaptation; Handwriting Recognition; Historical Data; Keyword Spotting; Neural Networks;
fLanguage
English
Publisher
ieee
Conference_Titel
Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
Conference_Location
Kolkata
Print_ISBN
978-1-4244-8353-2
Type
conf
DOI
10.1109/ICFHR.2010.61
Filename
5693548
Link To Document