• DocumentCode
    183384
  • Title

    Document Binarization Using Topological Clustering Guided Laplacian Energy Segmentation

  • Author

    Ayyalasomayajula, Kalyan Ram ; Brun, Anders

  • Author_Institution
    Dept. of Inf. Technol., Uppsala Univ., Uppsala, Sweden
  • fYear
    2014
  • fDate
    1-4 Sept. 2014
  • Firstpage
    523
  • Lastpage
    528
  • Abstract
    The current approach for text binarization proposes a clustering algorithm as a preprocessing stage to an energy-based segmentation method. It uses a clustering algorithm to obtain a coarse estimate of the background (BG) and foreground (FG) pixels. These estimates are used as a prior for the source and sink points of a graph cut implementation, which is used to efficiently find the minimum energy solution of an objective function to separate the BG and FG. The binary image thus obtained is used to refine the edge map that guides the graph cut algorithm. A final binary image is obtained by once again performing the graph cut guided by the refined edges on Laplacian of the image.
  • Keywords
    document image processing; graph theory; image classification; image segmentation; learning (artificial intelligence); pattern clustering; BG pixels; FG pixels; background pixels; binary image; classification; clustering algorithm; document binarization; edge map; foreground pixels; graph cut algorithm; guided Laplacian energy segmentation; machine learning; minimum energy solution; preprocessing stage; refined edges; text binarization; topological clustering; Bandwidth; Clustering algorithms; Image edge detection; Laplace equations; Manganese; Optics; Topology; Classification; Graph-theoretic methods; Image Processing; Machine Learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
  • Conference_Location
    Heraklion
  • ISSN
    2167-6445
  • Print_ISBN
    978-1-4799-4335-7
  • Type

    conf

  • DOI
    10.1109/ICFHR.2014.94
  • Filename
    6981073