DocumentCode :
178115
Title :
Bayesian Active Learning for Keyword Spotting in Handwritten Documents
Author :
Kumar, G. ; Govindaraju, V.
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. at Buffalo, Amherst, NY, USA
fYear :
2014
fDate :
24-28 Aug. 2014
Firstpage :
2041
Lastpage :
2046
Abstract :
We propose the Bayesian Active Learning by Disagreement (BALD) model for keyword spotting in handwritten documents. In the context of keyword spotting in handwritten documents, the background text is all regions in the document that do not contain the keywords. The model tries to learn certain characteristics of the keyword and background text in an active learning framework. It takes into account the local character level scores and global word level scores to distinguish keywords from non-keywords. We propose to apply the bayesian active learning strategy to identify the regions of sample space from which more meaningful labeled samples of keywords and non-keywords can be extracted. This work is an extension to our previous work which used a variational dynamic background model to model the large variations of background text. The approach has been tested on IAM dataset for English. The results show that a decent background model can be learned in a more quicker and efficient manner using the BALD framework. The approach outperforms our prior work and other state of the art approaches.
Keywords :
Bayes methods; document image processing; handwriting recognition; learning (artificial intelligence); text analysis; BALD model; Bayesian active learning by disagreement model; IAM dataset; background text; global word level scores; handwritten documents; keyword spotting; local character level scores; variational dynamic background model; Bayes methods; Entropy; Feature extraction; Hidden Markov models; Image segmentation; Logistics; Mathematical model; Bayesian Active Learning; Handwriting Recognition; Spotting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location :
Stockholm
ISSN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2014.356
Filename :
6977068
Link To Document :
بازگشت