• DocumentCode
    307111
  • Title

    Text segmentation for automatic document processing

  • Author

    Mital, Dinesh P. ; Leng, Goh Wee

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Nanyang Technol. Inst., Singapore
  • Volume
    2
  • fYear
    1996
  • fDate
    18-21 Nov 1996
  • Firstpage
    642
  • Abstract
    There is a considerable interest in designing automatic systems that can scan a given paper document and store it on electronic media for easier storage, manipulation and access. Most documents contain graphics and images, in addition to text. Thus, the document image has to be segmented to identify text and image regions, so that appropriate techniques may be applied to those regions. In this paper, we have presented a new technique for image segmentation in which text and image regions, in a given document image, are automatically identified. Technique is based on differential-processing text extraction concept. The proposed technique is capable of analysing complex document image layouts. Document image is processed by using textural feature analysis. Results of the proposed method are presented with test images which demonstrate the robustness of the technique
  • Keywords
    document image processing; image segmentation; automatic document processing; differential-processing text extraction concept; graphics; image regions; image segmentation; text regions; text segmentation; Data mining; Design engineering; Graphics; Image analysis; Image converters; Image segmentation; Optical character recognition software; Paper technology; Storage automation; Technical drawing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Emerging Technologies and Factory Automation, 1996. EFTA '96. Proceedings., 1996 IEEE Conference on
  • Conference_Location
    Kauai, HI
  • Print_ISBN
    0-7803-3685-2
  • Type

    conf

  • DOI
    10.1109/ETFA.1996.573971
  • Filename
    573971