DocumentCode
2475975
Title
Lexicon-based offline recognition of Amharic words in unconstrained handwritten text
Author
Assabie, Yaregal ; Bigun, Josef
Author_Institution
Sch. of Inf. Sci., Halmstad Univ., Halmstad, Sweden
fYear
2008
fDate
8-11 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
This paper describes an offline handwriting recognition system for Amharic words based on lexicon. The system computes direction fields of scanned handwritten documents, from which pseudo-characters are segmented. The pseudo-characters are organized based on their proximity and direction to form text lines. Words are then segmented by analyzing the relative gap between subsequent pseudo-characters in text lines. For each segmented word image, the structural characteristics of pseudo-characters are syntactically analyzed to predict a set of plausible characters forming the word. The most likelihood word is finally selected among candidates by matching against the lexicon. The system is tested by a database of unconstrained handwritten Amharic documents collected from various sources. The lexicon is prepared from words appearing in the collected database.
Keywords
document image processing; handwriting recognition; handwritten character recognition; image matching; image segmentation; natural language processing; word processing; Amharic words; lexicon-based offline recognition; offline handwriting recognition system; pseudo-characters segmentation; scanned handwritten Amharic documents; unconstrained handwritten text; word image segmention; Character generation; Character recognition; Handwriting recognition; Hidden Markov models; Image analysis; Image segmentation; Information science; Natural languages; Pixel; Text recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location
Tampa, FL
ISSN
1051-4651
Print_ISBN
978-1-4244-2174-9
Electronic_ISBN
1051-4651
Type
conf
DOI
10.1109/ICPR.2008.4761145
Filename
4761145
Link To Document