DocumentCode
2528521
Title
Segmentation of Document Images Using Higher Order Statistics
Author
Borges, Paulo Vinicius Koerich ; Mayer, Joceli ; Izquierdo, Ebroul
Author_Institution
Queen Mary Univ. of London, London
fYear
2007
fDate
1-3 Oct. 2007
Firstpage
296
Lastpage
299
Abstract
This work presents an efficient post-segmentation method for separating text from the background in document images. For this task, this paper proposes the use of textured patterns to represent text in documents, instead of the standard black. It is shown that, in poor quality documents, text segmentation is more efficient when the characters in the document are represented in a halftoned gray level prior to printing. This occurs because the halftoning process induces statistical characteristics that help the text to be distinguished from noise or background. A typical case are noisy printed and scanned documents. Experiments validate the analysis and the applicability of the segmentation method. An important application for the method is in the postal service, where letters have their addresses segmented for automatic sorting.
Keywords
document image processing; higher order statistics; image segmentation; automatic sorting; document images segmentation; halftoning process; higher order statistics; post-segmentation method; postal service; statistical characteristics; text segmentation; Background noise; Higher order statistics; Image analysis; Image classification; Image processing; Image segmentation; Image texture analysis; Postal services; Printing; Sorting;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Signal Processing, 2007. MMSP 2007. IEEE 9th Workshop on
Conference_Location
Crete
Print_ISBN
978-1-4244-1274-7
Type
conf
DOI
10.1109/MMSP.2007.4412876
Filename
4412876
Link To Document