DocumentCode :
2124708
Title :
Fuzzy Information Extraction on OCR Text
Author :
Pereda, Ray ; Taghva, Kazem
Author_Institution :
Inf. Sci. Res. Inst., Univ. of Nevada, Las Vegas, Las Vegas, NV, USA
fYear :
2011
fDate :
11-13 April 2011
Firstpage :
543
Lastpage :
546
Abstract :
In this paper, we report on two experiments on identification and extraction of Date of Birth instances. The objective of these experiments is to increase the recall level by increasing the edit distance while obtaining a reasonable precision.
Keywords :
fuzzy set theory; information retrieval; optical character recognition; text editing; OCR text; date of birth identification; edit distance; fuzzy information extraction; optical character recognition; Bills of materials; Data mining; HTML; Optical character recognition software; Pattern matching; Testing; Training data; OCR; duality; information extraction; patterns; relations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology: New Generations (ITNG), 2011 Eighth International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-61284-427-5
Electronic_ISBN :
978-0-7695-4367-3
Type :
conf
DOI :
10.1109/ITNG.2011.99
Filename :
5945294
Link To Document :
بازگشت