DocumentCode
2124708
Title
Fuzzy Information Extraction on OCR Text
Author
Pereda, Ray ; Taghva, Kazem
Author_Institution
Inf. Sci. Res. Inst., Univ. of Nevada, Las Vegas, Las Vegas, NV, USA
fYear
2011
fDate
11-13 April 2011
Firstpage
543
Lastpage
546
Abstract
In this paper, we report on two experiments on identification and extraction of Date of Birth instances. The objective of these experiments is to increase the recall level by increasing the edit distance while obtaining a reasonable precision.
Keywords
fuzzy set theory; information retrieval; optical character recognition; text editing; OCR text; date of birth identification; edit distance; fuzzy information extraction; optical character recognition; Bills of materials; Data mining; HTML; Optical character recognition software; Pattern matching; Testing; Training data; OCR; duality; information extraction; patterns; relations;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology: New Generations (ITNG), 2011 Eighth International Conference on
Conference_Location
Las Vegas, NV
Print_ISBN
978-1-61284-427-5
Electronic_ISBN
978-0-7695-4367-3
Type
conf
DOI
10.1109/ITNG.2011.99
Filename
5945294
Link To Document