DocumentCode
2011806
Title
Skew Estimation of Sparsely Inscribed Document Fragments
Author
Diem, Markus ; Kleber, Florian ; Sablatnig, Robert
Author_Institution
Comput. Vision Lab., Vienna Univ. of Technol., Vienna, Austria
fYear
2012
fDate
27-29 March 2012
Firstpage
292
Lastpage
296
Abstract
Document analysis is done to analyze entire forms (e.g. intelligent form analysis, table detection) or to describe the layout/structure of a document for further processing. A pre-processing step of document analysis methods is a skew estimation of scanned or photographed documents. Current skew estimation methods require the existence of large text areas, are dependent on the text type and can be limited on a specific angle range. The proposed method is gradient based in combination with a Focused Nearest Neighbor Clustering of interest points and has no limitations regarding the detectable angle range. The upside/down decision is based on statistical analysis of ascenders and descenders. It can be applied to entire documents as well as to document fragments containing only a few words. Results show that the proposed skew estimation is comparable with state-of-the-art methods and outperforms them on a real dataset consisting of 658 snippets.
Keywords
document image processing; estimation theory; optical character recognition; pattern clustering; statistical analysis; text analysis; OCR method; ascenders; descenders; document analysis method; document processing; focused nearest neighbor clustering; gradient based method; photographed document; scanned document; skew estimation method; sparsely inscribed document fragment; statistical analysis; upside-down decision; Algorithm design and analysis; Estimation; Histograms; Robustness; Text analysis; Transforms; Vectors; document fragments; rotation; skew estimation;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
Conference_Location
Gold Cost, QLD
Print_ISBN
978-1-4673-0868-7
Type
conf
DOI
10.1109/DAS.2012.81
Filename
6195381
Link To Document