• DocumentCode
    2528521
  • Title

    Segmentation of Document Images Using Higher Order Statistics

  • Author

    Borges, Paulo Vinicius Koerich ; Mayer, Joceli ; Izquierdo, Ebroul

  • Author_Institution
    Queen Mary Univ. of London, London
  • fYear
    2007
  • fDate
    1-3 Oct. 2007
  • Firstpage
    296
  • Lastpage
    299
  • Abstract
    This work presents an efficient post-segmentation method for separating text from the background in document images. For this task, this paper proposes the use of textured patterns to represent text in documents, instead of the standard black. It is shown that, in poor quality documents, text segmentation is more efficient when the characters in the document are represented in a halftoned gray level prior to printing. This occurs because the halftoning process induces statistical characteristics that help the text to be distinguished from noise or background. A typical case are noisy printed and scanned documents. Experiments validate the analysis and the applicability of the segmentation method. An important application for the method is in the postal service, where letters have their addresses segmented for automatic sorting.
  • Keywords
    document image processing; higher order statistics; image segmentation; automatic sorting; document images segmentation; halftoning process; higher order statistics; post-segmentation method; postal service; statistical characteristics; text segmentation; Background noise; Higher order statistics; Image analysis; Image classification; Image processing; Image segmentation; Image texture analysis; Postal services; Printing; Sorting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Signal Processing, 2007. MMSP 2007. IEEE 9th Workshop on
  • Conference_Location
    Crete
  • Print_ISBN
    978-1-4244-1274-7
  • Type

    conf

  • DOI
    10.1109/MMSP.2007.4412876
  • Filename
    4412876