• DocumentCode
    1614129
  • Title

    Page Segmentation of Persian/Arabic Printed Text Using Ink Spread Effect

  • Author

    Shirali-Shahreza, Sajad ; Manzuri-Shalmani, M.T. ; Shirali-Shahreza, M. Hassan

  • Author_Institution
    Dept. of Comput. Eng., Sharif Univ. of Technol., Tehran
  • fYear
    2006
  • Firstpage
    259
  • Lastpage
    262
  • Abstract
    Nowadays, OCR (optical character recognition) is widely used for converting written documents to digital documents. One of the OCR phases is page segmentation. In page segmentation, text regions must be found in input image. In addition, text parts like text columns must be separated. In this paper, a new method for segmenting Persian/Arabic printed text is proposed. This method is based on ink spread effect idea, a new idea that has particular features. Main features of Persian/Arabic scripts are considered in designing this method. This method is skew resistant and can segment text within frames and tables or regions with gray background
  • Keywords
    document image processing; optical character recognition; text analysis; Persian/Arabic printed text; document conversion; ink spread effect; optical character recognition; page segmentation; Character recognition; Design methodology; Image converters; Image processing; Image segmentation; Ink; Natural languages; Optical character recognition software; Optical computing; Pattern recognition; Image Processing; OCR; Page Segmentation; Pattern Recognition; Persian/Arabic Document;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    SICE-ICASE, 2006. International Joint Conference
  • Conference_Location
    Busan
  • Print_ISBN
    89-950038-4-7
  • Electronic_ISBN
    89-950038-5-5
  • Type

    conf

  • DOI
    10.1109/SICE.2006.315618
  • Filename
    4108835