• DocumentCode
    2349286
  • Title

    A cross-connected components-based layout analysis algorithm for Chinese business card

  • Author

    Huiying Zhu ; Yuexian Zou

  • Author_Institution
    Shenzhen Grad. Sch., Peking Univ., Beijing
  • fYear
    2008
  • fDate
    3-5 June 2008
  • Firstpage
    2530
  • Lastpage
    2534
  • Abstract
    In this paper, a cross-connected components-based layout analysis algorithm for Chinese business card (CBC) is presented. The major aim of our scheme is to extract important personal information such as the holder´s name, title, address, phone number, etc. from the CBC image. Though much work has been done on layout analysis at present, this task is still very difficult considering the characteristics of the CBCs, which is generally complex layout, mixed Chinese-English characters and diverse typesetting. Conventionally, detection of connected components (CCs) of CBCs is based on 4 or 8-connectivity, both of which have a problem of producing lots of small CC pieces, making it difficult to merge them into a line. To solve this problem, we proposed a novel cross-connected component extraction algorithm (CCCE) along with a Center-Height Component Mergence (CHCM) algorithm for CBC layout analysis. We implemented our method on a common personal computer. The experimental results show that our method can greatly reduce the computational complexity and exhibits high CBC layout analysis accuracy.
  • Keywords
    character recognition; image recognition; natural language processing; Chinese business card; Chinese-English characters; center-height component mergence; cross-connected components-based layout analysis; diverse typesetting; Algorithm design and analysis; Carbon capture and storage; Character recognition; Companies; Computational complexity; Computational efficiency; Data mining; Laboratories; Microcomputers; Typesetting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Electronics and Applications, 2008. ICIEA 2008. 3rd IEEE Conference on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-1-4244-1717-9
  • Electronic_ISBN
    978-1-4244-1718-6
  • Type

    conf

  • DOI
    10.1109/ICIEA.2008.4582975
  • Filename
    4582975