• DocumentCode
    2475975
  • Title

    Lexicon-based offline recognition of Amharic words in unconstrained handwritten text

  • Author

    Assabie, Yaregal ; Bigun, Josef

  • Author_Institution
    Sch. of Inf. Sci., Halmstad Univ., Halmstad, Sweden
  • fYear
    2008
  • fDate
    8-11 Dec. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper describes an offline handwriting recognition system for Amharic words based on lexicon. The system computes direction fields of scanned handwritten documents, from which pseudo-characters are segmented. The pseudo-characters are organized based on their proximity and direction to form text lines. Words are then segmented by analyzing the relative gap between subsequent pseudo-characters in text lines. For each segmented word image, the structural characteristics of pseudo-characters are syntactically analyzed to predict a set of plausible characters forming the word. The most likelihood word is finally selected among candidates by matching against the lexicon. The system is tested by a database of unconstrained handwritten Amharic documents collected from various sources. The lexicon is prepared from words appearing in the collected database.
  • Keywords
    document image processing; handwriting recognition; handwritten character recognition; image matching; image segmentation; natural language processing; word processing; Amharic words; lexicon-based offline recognition; offline handwriting recognition system; pseudo-characters segmentation; scanned handwritten Amharic documents; unconstrained handwritten text; word image segmention; Character generation; Character recognition; Handwriting recognition; Hidden Markov models; Image analysis; Image segmentation; Information science; Natural languages; Pixel; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
  • Conference_Location
    Tampa, FL
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-2174-9
  • Electronic_ISBN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2008.4761145
  • Filename
    4761145