Title :
Text Segmentation in Colour Posters from the Spanish Civil War Era
Author :
Clavelli, Antonio ; Karatzas, Dimosthenis
Author_Institution :
Comput. Vision Centre, UAB, Barcelona, Spain
Abstract :
The extraction of textual content from colour documents of a graphical nature is a complicated task. The text can be rendered in any colour, size and orientation while the existence of complex background graphics with repetitive patterns can make its localization and segmentation extremely difficult. Here, we propose a new method for extracting textual content from such colour images that makes no assumption as to the size of the characters, their orientation or colour, while it is tolerant to characters that do not follow a straight baseline. We evaluate this method on a collection of documents with historical connotations: the posters from the Spanish Civil War.
Keywords :
document image processing; feature extraction; image colour analysis; image segmentation; image texture; rendering (computer graphics); text analysis; Spanish Civil War Era; colour poster; document collection; rendering; text segmentation; textual content extraction; Colored noise; Computer graphics; Computer vision; Focusing; Image color analysis; Image segmentation; Image sequence analysis; Rendering (computer graphics); Text analysis; Text recognition; colour DIA; historical documents; text extraction;
Conference_Titel :
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-4500-4
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2009.32