• DocumentCode
    2147496
  • Title

    A Chinese Character Localization Method Based on Intergrating Structure and CC-Clustering for Advertising Images

  • Author

    Liu, Jie ; Zhang, Shuwu ; Li, Heping ; Liang, Wei

  • Author_Institution
    Inst. of Autom., Beijing, China
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    1044
  • Lastpage
    1048
  • Abstract
    In this paper, a novel Chinese character localization method is proposed for texts in advertising images. To deal with the texts with gradient color, a color clustering method based on edge is introduced to separate the color image into homogeneous color layers. To solve the problem of locating characters varied in size, style and arranged in irregular direction, a novel character localization method is proposed, which integrates structure and CC-clustering to locate characters according to reliable features of characters. Finally, a new noise removal method based on stroke width histogram is employed to remove all non-characters connected components, and then all characters are located. The experimental results show that the proposed method can effectively locate characters in advertising images.
  • Keywords
    gradient methods; image colour analysis; natural language processing; pattern clustering; text analysis; CC clustering; Chinese character localization method; advertising images; color clustering method; gradient color; intergrating structure; noise removal method; text analysis; Advertising; Colored noise; Feature extraction; Image color analysis; Image edge detection; Merging; Reliability; character localization; color clustering; connected component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2011 International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4577-1350-7
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2011.211
  • Filename
    6065469