• DocumentCode
    382286
  • Title

    Robust video text segmentation and recognition with multiple hypotheses

  • Author

    Dobez, J.-M. ; Chen, Datong

  • Author_Institution
    IDIAP, Switzerland
  • Volume
    2
  • fYear
    2002
  • fDate
    2002
  • Abstract
    A method for segmenting and recognizing text embedded in video and images is proposed. Multiple segmentation of the same text region is performed, thus producing multiple hypotheses of binary text images. The segmentation algorithm is stated as a statistical labeling and is based on a Markov random field (MRF) model of the label map. Background regions in each hypothesis are then removed by performing a connected component analysis and by enforcing a more stringent constraint (called GCC - grayscale consistency constraint) on the text characters´ grayscale values using a robust 1D-median operator. Each text image hypothesis is then processed by optical character recognition (OCR) software. The final result is then selected from the set of output strings. Results show that both the use of multiple hypotheses and the GCC significantly improve the results.
  • Keywords
    Markov processes; feature extraction; image segmentation; optical character recognition; text analysis; video signal processing; 1D-median operator; Markov random field; binary text images; connected component analysis; grayscale consistency constraint; image segmentation; multiple hypotheses; optical character recognition; statistical labeling; video segmentation; video text segmentation; Character recognition; Gray-scale; Image recognition; Image segmentation; Labeling; Markov random fields; Optical character recognition software; Performance analysis; Robustness; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing. 2002. Proceedings. 2002 International Conference on
  • ISSN
    1522-4880
  • Print_ISBN
    0-7803-7622-6
  • Type

    conf

  • DOI
    10.1109/ICIP.2002.1039980
  • Filename
    1039980