Title :
Fast Seamless Skew and Orientation Detection in Document Images
Author :
Konya, Iuliu ; Eickeler, Stefan ; Seibert, Christoph
Author_Institution :
Fraunhofer Inst. for Intell. Anal. & Inf. Syst. (IAIS), St. Augustin, Germany
Abstract :
Reliable and generic methods for skew detection are a necessity for any large-scale digitization projects. As one of the first processing steps, skew detection and correction has a heavy influence on all further document analysis modules, such as geometric and logical layout analysis. This paper introduces a generic, scale-independent algorithm capable of accurately detecting the global skew angle of document images within the range [-90°, 90°]. By using the same framework, the algorithm is then extended for Roman script documents so as to cope with the full range [-180°, 180°) of possible skew angles. Despite its generality, the improved algorithm is very fast and requires no explicit parameters. Experiments on a combined test set comprising around 110000 real-life images show the accuracy and robustness of the proposed method.
Keywords :
document image processing; image recognition; Roman script documents; document analysis; document image orientation detection; document images; fast seamless document image skew detection; geometric analysis; large-scale digitization projects; logical layout analysis; Accuracy; Algorithm design and analysis; Histograms; Image edge detection; Layout; Robustness; Text analysis; document analysis; orientation detection; skew detection;
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-7542-1
DOI :
10.1109/ICPR.2010.474