• DocumentCode
    2124708
  • Title

    Fuzzy Information Extraction on OCR Text

  • Author

    Pereda, Ray ; Taghva, Kazem

  • Author_Institution
    Inf. Sci. Res. Inst., Univ. of Nevada, Las Vegas, Las Vegas, NV, USA
  • fYear
    2011
  • fDate
    11-13 April 2011
  • Firstpage
    543
  • Lastpage
    546
  • Abstract
    In this paper, we report on two experiments on identification and extraction of Date of Birth instances. The objective of these experiments is to increase the recall level by increasing the edit distance while obtaining a reasonable precision.
  • Keywords
    fuzzy set theory; information retrieval; optical character recognition; text editing; OCR text; date of birth identification; edit distance; fuzzy information extraction; optical character recognition; Bills of materials; Data mining; HTML; Optical character recognition software; Pattern matching; Testing; Training data; OCR; duality; information extraction; patterns; relations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology: New Generations (ITNG), 2011 Eighth International Conference on
  • Conference_Location
    Las Vegas, NV
  • Print_ISBN
    978-1-61284-427-5
  • Electronic_ISBN
    978-0-7695-4367-3
  • Type

    conf

  • DOI
    10.1109/ITNG.2011.99
  • Filename
    5945294