DocumentCode :
1492890
Title :
Skew angle detection of digitized Indian script documents
Author :
Chaudhuri, B.B. ; Pal, U.
Author_Institution :
Comput. Vision & Pattern Recognition, Indian Stat. Inst., Calcutta, India
Volume :
19
Issue :
2
fYear :
1997
fDate :
2/1/1997 12:00:00 AM
Firstpage :
182
Lastpage :
186
Abstract :
Skew angle detection of scanned documents containing most popular Indian scripts (Devnagari and Bangla) is considered. Most characters in these scripts have horizontal lines at the top, called head lines. The character head lines mostly join one another in a word and the word appears as a single component. In the proposed method the components are at first labeled. The upper envelope of a component is found by columnwise scanning from an imaginary line above the component. Portions of upper envelope satisfying the properties of digital straight line are detected. They are clustered as belonging to single text lines. Estimates from individual clusters are combined to get the skew angle. Apart from accuracy and efficiency, an advantage of the method is that character segmentation and zone detection can be readily done from head line information, which is useful in optical character recognition approaches of these scripts
Keywords :
image segmentation; optical character recognition; Bangla; Devnagari; OCR; character head lines; character segmentation; clusters; columnwise scanning; digitized Indian script documents; optical character recognition; scanned documents; skew angle detection; upper envelope; zone detection; Character recognition; Envelope detectors; Magnetic heads; Natural languages; Optical character recognition software; Optical detectors; Optical sensors; Software libraries; Strips; Text analysis;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/34.574803
Filename :
574803
Link To Document :
بازگشت