• DocumentCode
    3489087
  • Title

    Near Convex Region Adjacency Graph and Approximate Neighborhood String Matching for Symbol Spotting in Graphical Documents

  • Author

    Dutta, Arin ; Llados, Josep ; Bunke, Horst ; Pal, Umapada

  • Author_Institution
    Comput. Vision Center, Univ. Autonoma de Barcelona, Barcelona, Spain
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    1078
  • Lastpage
    1082
  • Abstract
    This paper deals with a sub graph matching problem in Region Adjacency Graph (RAG) applied to symbol spotting in graphical documents. RAG is a very important, efficient and natural way of representing graphical information with a graph but this is limited to cases where the information is well defined with perfectly delineated regions. What if the information we are interested in is not confined within well defined regions? This paper addresses this particular problem and solves it by defining near convex grouping of oriented line segments which results in near convex regions. Pure convexity imposes hard constraints and can not handle all the cases efficiently. Hence to solve this problem we have defined a new type of convexity of regions, which allows convex regions to have concavity to some extend. We call this kind of regions Near Convex Regions (NCRs). These NCRs are then used to create the Near Convex Region Adjacency Graph (NCRAG) and with this representation we have formulated the problem of symbol spotting in graphical documents as a sub graph matching problem. For sub graph matching we have used the Approximate Edit Distance Algorithm (AEDA) on the neighborhood string, which starts working after finding a key node in the input or target graph and iteratively identifies similar nodes of the query graph in the neighborhood of the key node. The experiments are performed on artificial, real and distorted datasets.
  • Keywords
    document handling; graph theory; string matching; AEDA; NCRAG; RAG; approximate edit distance algorithm; approximate neighborhood string matching; artificial datasets; distorted datasets; graphical documents; near convex region adjacency graph; query graph; real datasets; subgraph matching problem; symbol spotting; Approximation algorithms; Computer vision; Image segmentation; Robustness; Text analysis; Tin; Approximate Edit Distance Algorithm; Graphics Recognition; Near Convex Region Adjacency Graph; Subgraph Matching; Symbol Spotting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.215
  • Filename
    6628780