DocumentCode :
1247818
Title :
A parallel-line detection algorithm based on HMM decoding
Author :
Zheng, Yefeng ; Li, Huiping ; Doermann, David
Author_Institution :
Language & Media Process. Lab., Maryland Univ., College Park, MD, USA
Volume :
27
Issue :
5
fYear :
2005
fDate :
5/1/2005 12:00:00 AM
Firstpage :
777
Lastpage :
792
Abstract :
The detection of groups of parallel lines is important in applications such as form processing and text (handwriting) extraction from rule lined paper. These tasks can be very challenging in degraded documents where the lines are severely broken. In this paper, we propose a novel model-based method which incorporates high-level context to detect these lines. After preprocessing (such as skew correction and text filtering), we use trained hidden Markov models (HMM) to locate the optimal positions of all lines simultaneously on the horizontal or vertical projection profiles, based on the Viterbi decoding. The algorithm is trainable so it can be easily adapted to different application scenarios. The experiments conducted on known form processing and rule line detection show our method is robust, and achieves better results than other widely used line detection methods.
Keywords :
Viterbi decoding; document image processing; feature extraction; hidden Markov models; Viterbi decoding; document image analysis; form processing; hidden Markov models; model-based method; parallel-line detection algorithm; rule line detection; text extraction; Context modeling; Decoding; Degradation; Detection algorithms; Filtering; Hidden Markov models; Image quality; Optical character recognition software; Text analysis; Viterbi algorithm; Index Terms- Line detection; document image analysis.; form identification; form processing; form registration; hidden Markov model; Algorithms; Artificial Intelligence; Automatic Data Processing; Computer Graphics; Documentation; Handwriting; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Markov Chains; Models, Statistical; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Signal Processing, Computer-Assisted; Subtraction Technique;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2005.89
Filename :
1407880
Link To Document :
بازگشت