• DocumentCode
    2489472
  • Title

    A discriminative semi-Markov model for robust scene text recognition

  • Author

    Weinman, Jerod J. ; Learned-Miller, Erik ; Hanson, Allen

  • Author_Institution
    Dept. of Comput. Sci., Grinnell Coll., IA
  • fYear
    2008
  • fDate
    8-11 Dec. 2008
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    We present a semi-Markov model for recognizing scene text that integrates character and word segmentation with recognition. Using wavelet features, it requires only approximate location of the text baseline and font size; no binarization or prior word segmentation is necessary. Our system is aided by a lexicon, yet it also allows non-lexicon words. To facilitate inference with a large lexicon, we use an approximate Viterbi beam search. Our system performs robustly on low-resolution images of signs containing text in fonts atypical of documents.
  • Keywords
    Markov processes; character recognition; document handling; feature extraction; image segmentation; maximum likelihood estimation; text analysis; wavelet transforms; approximate Viterbi beam search; character segmentation; discriminative semiMarkov model; nonlexicon words; robust scene text recognition; scene text recognition; wavelet features; word segmentation; Character recognition; Computer science; Educational institutions; Handwriting recognition; Image segmentation; Layout; Noise robustness; Text recognition; Typesetting; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
  • Conference_Location
    Tampa, FL
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-2174-9
  • Electronic_ISBN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2008.4761818
  • Filename
    4761818