• DocumentCode
    1132589
  • Title

    Document image segmentation using wavelet scale-space features

  • Author

    Acharyya, Mausumi ; Kundu, Malay K.

  • Author_Institution
    Machine Intelligence Unit, Indian Stat. Inst., Calcutta, India
  • Volume
    12
  • Issue
    12
  • fYear
    2002
  • fDate
    12/1/2002 12:00:00 AM
  • Firstpage
    1117
  • Lastpage
    1127
  • Abstract
    An efficient and computationally fast method for segmenting text and graphics part of document images based on textural cues is presented. We assume that the graphics part have different textural properties than the nongraphics (text) part. The segmentation method uses the notion of multiscale wavelet analysis and statistical pattern recognition. We have used M-band wavelets which decompose an image into M×M bandpass channels. Various combinations of these channels represent the image at different scales and orientations in the frequency plane. The objective is to transform the edges between textures into detectable discontinuities and create the feature maps which give a measure of the local energy around each pixel at different scales. From these feature maps, a scale-space signature is derived, which is the vector of features at different scales taken at each single pixel in an image. We achieve segmentation by simple analysis of the scale-space signature with traditional k- means clustering. We do not assume any a priori information regarding the font size, scanning resolution, type of layout, etc. of the document in our segmentation scheme.
  • Keywords
    channel bank filters; document image processing; feature extraction; filtering theory; image segmentation; pattern clustering; pattern recognition; wavelet transforms; M-band wavelet filters; bandpass channels; detectable discontinuities; document image segmentation; feature maps; filter bank; graphics; image decomposition; k-means clustering; local energy; multiscale wavelet analysis; pixel; scale-space signature; statistical pattern recognition; textural cues; textural properties; wavelet scale-space features; Energy measurement; Frequency; Graphics; Image edge detection; Image segmentation; Image storage; Pattern analysis; Pattern recognition; Pixel; Wavelet analysis;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/TCSVT.2002.806812
  • Filename
    1175448