• DocumentCode
    595404
  • Title

    A learning framework for degraded document image binarization using Markov Random Field

  • Author

    Bolan Su ; Shijian Lu ; Chew Lim Tan

  • Author_Institution
    Dept. of Comput. Sci., Nat. Univ. of Singapore, Singapore, Singapore
  • fYear
    2012
  • fDate
    11-15 Nov. 2012
  • Firstpage
    3200
  • Lastpage
    3203
  • Abstract
    Document image binarization is an important preprocessing technique for document image analysis that segments the text from the document image backgrounds. Many techniques have been proposed and successfully applied in different applications, such as document image retrieval. However, these techniques may perform poorly on degraded document images. In this paper, we propose a learning framework that makes use of the Markov Random Field to improve the performance of the existing document image binarization methods for those degraded document images. Extensive experiments on the recent Document Image Bina-rization Contest datasets demonstrate that significant improvements of the existing binarization methods when applying our proposed framework.
  • Keywords
    Markov processes; document image processing; image retrieval; image segmentation; learning (artificial intelligence); random processes; text analysis; Markov random field; degraded document image binarization methods; document image analysis; document image backgrounds; document image binarization contest datasets; document image retrieval; learning framework; text segmentation; Equations; Image edge detection; Markov random fields; Mathematical model; Text analysis; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2012 21st International Conference on
  • Conference_Location
    Tsukuba
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4673-2216-4
  • Type

    conf

  • Filename
    6460845