• DocumentCode
    591962
  • Title

    Multi-lingual City Name Recognition for Indian Postal Automation

  • Author

    Pal, Umapada ; Roy, R.K. ; Kimura, Fumitaka

  • Author_Institution
    Comput. Vision & Pattern Recognition Unit, Indian Stat. Inst., Kolkata, India
  • fYear
    2012
  • fDate
    18-20 Sept. 2012
  • Firstpage
    169
  • Lastpage
    173
  • Abstract
    Under three-language formula, the destination address block of postal document of an Indian state is generally written in three languages: English, Hindi and the State official language. From the statistical analysis we found that 12.37%, 76.32% and 10.21% postal documents are written in Bangla, English and Devanagari script, respectively. Because of inter-mixing of these scripts in postal address writings, it is very difficult to identify the script by which a city name is written. To avoid such script identification difficulties, in this paper we proposed a lexicon-driven method for multi-lingual (English, Hindi and Bangla) city name recognition for Indian postal automation. In the proposed scheme, at first, to take care of slanted handwriting of different individuals a slant correction technique is performed. Next, a water reservoir concept is applied to pre-segment the slant corrected city names into possible primitive components (characters or its parts). Pre-segmented components of a city name are then merged into possible characters to get the best city name using the lexicon information. In order to merge these primitive components into characters and to find optimum character segmentation, dynamic programming (DP) is applied using total likelihood of the characters of a city name as an objective function. We tested our system on 16132 Indian trilingual city names and 92.25% overall recognition accuracy was obtained.
  • Keywords
    dynamic programming; handwritten character recognition; image segmentation; postal services; statistical analysis; Devanagari script; English language; Hindi language; Indian postal automation; State official language; character segmentation; destination address block; dynamic programming; lexicon-driven method; multilingual city name recognition; objective function; postal address writing; postal document; recognition accuracy; script identification; slant correction technique; slant presegmentation; slanted handwriting; statistical analysis; three-language formula; water reservoir concept; Automation; Cavity resonators; Cities and towns; Dynamic programming; Feature extraction; Handwriting recognition; Image segmentation; Handwritten character recognition; Indian Script; Indian postal automation; city name recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
  • Conference_Location
    Bari
  • Print_ISBN
    978-1-4673-2262-1
  • Type

    conf

  • DOI
    10.1109/ICFHR.2012.238
  • Filename
    6424387