DocumentCode
2145009
Title
Discriminative Bernoulli Mixture Models for Handwritten Digit Recognition
Author
Giménez, Adrià ; Andrés-Ferrer, J. ; Juan, Alfons ; Serrano, Nicolás
Author_Institution
DSIC, Univ. Politec. de Valencia, Valencia, Spain
fYear
2011
fDate
18-21 Sept. 2011
Firstpage
558
Lastpage
562
Abstract
Bernoulli-based models such as Bernoulli mixtures or Bernoulli HMMs (BHMMs), have been successfully applied to several handwritten text recognition (HTR) tasks which range from character recognition to continuous and isolated handwritten words. All these models belong to the generative model family and, hence, are usually trained by (joint) maximum likelihood estimation (MLE). Despite the good properties of the MLE criterion, there are better training criteria such as maximum mutual information (MMI). The MMI is a widespread criterion that is mainly employed to train discriminative models such as log-linear (or maximum entropy) models. Inspired by the Bernoulli mixture classifier, in this work a log-linear model for binary data is proposed, the so-called mixture of multi-class logistic regression. The proposed model is proved to be equivalent to the Bernoulli mixture classifier. In this way, we give a discriminative training framework for Bernoulli mixture models. The proposed discriminative training framework is applied to a well-known Indian digit recognition task.
Keywords
entropy; handwritten character recognition; learning (artificial intelligence); maximum likelihood decoding; maximum likelihood estimation; natural language processing; regression analysis; text analysis; Bernoulli HMMs; Bernoulli mixture classifier; Indian digit recognition task; binary data; character recognition; continuous handwritten words; discriminative Bernoulli mixture models; discriminative training framework; handwritten digit recognition; handwritten text recognition; isolated handwritten words; log-linear model; maximum entropy model; maximum likelihood estimation; maximum mutual information; multiclass logistic regression; Data models; Handwriting recognition; Hidden Markov models; Joints; Logistics; Maximum likelihood estimation; Training; Bernoulli mixture; MMI; discriminative training; log-linear models; mixture of multi-class logistic regression;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location
Beijing
ISSN
1520-5363
Print_ISBN
978-1-4577-1350-7
Electronic_ISBN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2011.118
Filename
6065373
Link To Document