Title :
Skew Estimation of Sparsely Inscribed Document Fragments
Author :
Diem, Markus ; Kleber, Florian ; Sablatnig, Robert
Author_Institution :
Comput. Vision Lab., Vienna Univ. of Technol., Vienna, Austria
Abstract :
Document analysis is done to analyze entire forms (e.g. intelligent form analysis, table detection) or to describe the layout/structure of a document for further processing. A pre-processing step of document analysis methods is a skew estimation of scanned or photographed documents. Current skew estimation methods require the existence of large text areas, are dependent on the text type and can be limited on a specific angle range. The proposed method is gradient based in combination with a Focused Nearest Neighbor Clustering of interest points and has no limitations regarding the detectable angle range. The upside/down decision is based on statistical analysis of ascenders and descenders. It can be applied to entire documents as well as to document fragments containing only a few words. Results show that the proposed skew estimation is comparable with state-of-the-art methods and outperforms them on a real dataset consisting of 658 snippets.
Keywords :
document image processing; estimation theory; optical character recognition; pattern clustering; statistical analysis; text analysis; OCR method; ascenders; descenders; document analysis method; document processing; focused nearest neighbor clustering; gradient based method; photographed document; scanned document; skew estimation method; sparsely inscribed document fragment; statistical analysis; upside-down decision; Algorithm design and analysis; Estimation; Histograms; Robustness; Text analysis; Transforms; Vectors; document fragments; rotation; skew estimation;
Conference_Titel :
Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
Conference_Location :
Gold Cost, QLD
Print_ISBN :
978-1-4673-0868-7
DOI :
10.1109/DAS.2012.81