• DocumentCode
    2853595
  • Title

    Hybrid Chinese/English text identification in Web images

  • Author

    Jiaying He ; Shaofa Li

  • fYear
    2004
  • fDate
    18-20 Dec. 2004
  • Firstpage
    361
  • Lastpage
    364
  • Abstract
    In this paper, a novel algorithm is presented for hybrid Chinese/English text location, segmentation and Chinese character reconstruction in Web images. Since Web images have certain characteristics that distinguish them from conventional complex background images, most text segmentation algorithms with good performance in other fields fail to recognize Web images text. This paper proposes an algorithm that aims to locate and segment hybrid text in Web images, and to retrieve complete Chinese characters using a novel character reconstruction algorithm. Experimental result shows that our approach has high text detection rate and fast processing speed when identifying Web image text, and has promising result in segmentation of oriental text symbols such as Chinese, Japanese and Korea characters.
  • Keywords
    Internet; character recognition; image reconstruction; image segmentation; text analysis; Chinese character reconstruction; Web image text; complex background image; hybrid Chinese-English text identification; text detection; text segmentation algorithm; Character recognition; Graphics; Helium; Image analysis; Image color analysis; Image reconstruction; Image retrieval; Image segmentation; Text analysis; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image and Graphics (ICIG'04), Third International Conference on
  • Conference_Location
    Hong Kong, China
  • Print_ISBN
    0-7695-2244-0
  • Type

    conf

  • DOI
    10.1109/ICIG.2004.78
  • Filename
    1410459