DocumentCode :
2528521
Title :
Segmentation of Document Images Using Higher Order Statistics
Author :
Borges, Paulo Vinicius Koerich ; Mayer, Joceli ; Izquierdo, Ebroul
Author_Institution :
Queen Mary Univ. of London, London
fYear :
2007
fDate :
1-3 Oct. 2007
Firstpage :
296
Lastpage :
299
Abstract :
This work presents an efficient post-segmentation method for separating text from the background in document images. For this task, this paper proposes the use of textured patterns to represent text in documents, instead of the standard black. It is shown that, in poor quality documents, text segmentation is more efficient when the characters in the document are represented in a halftoned gray level prior to printing. This occurs because the halftoning process induces statistical characteristics that help the text to be distinguished from noise or background. A typical case are noisy printed and scanned documents. Experiments validate the analysis and the applicability of the segmentation method. An important application for the method is in the postal service, where letters have their addresses segmented for automatic sorting.
Keywords :
document image processing; higher order statistics; image segmentation; automatic sorting; document images segmentation; halftoning process; higher order statistics; post-segmentation method; postal service; statistical characteristics; text segmentation; Background noise; Higher order statistics; Image analysis; Image classification; Image processing; Image segmentation; Image texture analysis; Postal services; Printing; Sorting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing, 2007. MMSP 2007. IEEE 9th Workshop on
Conference_Location :
Crete
Print_ISBN :
978-1-4244-1274-7
Type :
conf
DOI :
10.1109/MMSP.2007.4412876
Filename :
4412876
Link To Document :
بازگشت