• DocumentCode
    2145009
  • Title

    Discriminative Bernoulli Mixture Models for Handwritten Digit Recognition

  • Author

    Giménez, Adrià ; Andrés-Ferrer, J. ; Juan, Alfons ; Serrano, Nicolás

  • Author_Institution
    DSIC, Univ. Politec. de Valencia, Valencia, Spain
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    558
  • Lastpage
    562
  • Abstract
    Bernoulli-based models such as Bernoulli mixtures or Bernoulli HMMs (BHMMs), have been successfully applied to several handwritten text recognition (HTR) tasks which range from character recognition to continuous and isolated handwritten words. All these models belong to the generative model family and, hence, are usually trained by (joint) maximum likelihood estimation (MLE). Despite the good properties of the MLE criterion, there are better training criteria such as maximum mutual information (MMI). The MMI is a widespread criterion that is mainly employed to train discriminative models such as log-linear (or maximum entropy) models. Inspired by the Bernoulli mixture classifier, in this work a log-linear model for binary data is proposed, the so-called mixture of multi-class logistic regression. The proposed model is proved to be equivalent to the Bernoulli mixture classifier. In this way, we give a discriminative training framework for Bernoulli mixture models. The proposed discriminative training framework is applied to a well-known Indian digit recognition task.
  • Keywords
    entropy; handwritten character recognition; learning (artificial intelligence); maximum likelihood decoding; maximum likelihood estimation; natural language processing; regression analysis; text analysis; Bernoulli HMMs; Bernoulli mixture classifier; Indian digit recognition task; binary data; character recognition; continuous handwritten words; discriminative Bernoulli mixture models; discriminative training framework; handwritten digit recognition; handwritten text recognition; isolated handwritten words; log-linear model; maximum entropy model; maximum likelihood estimation; maximum mutual information; multiclass logistic regression; Data models; Handwriting recognition; Hidden Markov models; Joints; Logistics; Maximum likelihood estimation; Training; Bernoulli mixture; MMI; discriminative training; log-linear models; mixture of multi-class logistic regression;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2011 International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4577-1350-7
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2011.118
  • Filename
    6065373