Title :
Development of Template-Free Form Recognition System
Author :
Hirayama, Junichi ; Shinjo, Hiroshi ; Takahashi, Toshikazu ; Nagasaki, Takeshi
Author_Institution :
Central Res. Lab., Hitachi, Ltd., Tokyo, Japan
Abstract :
We present a new form recognition technique. In our work, we were especially interested in developing a ´template-free´ form recognition technique that extracts and recognizes target characters without pre-defined layout knowledge (form-template). We also attempted to overcome well known difficulties in developing template-free form recognition techniques, i.e., extracting items from noisy form images and ambiguous alignment layout forms. We were able to use a hypothesis testing approach to successfully extract such items from such form images.
Keywords :
character recognition; document image processing; feature extraction; ambiguous alignment layout form; character extraction; character recognition; hypothesis testing approach; noisy form image; template-free form recognition system; Character recognition; Dictionaries; Layout; Noise measurement; Optical character recognition software; Portable document format; Target recognition; Character Recognition; Document Layout Analysis; Form Recognition; Meta Extraction;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2011.56