Title :
Computerising natural history card archives
Author :
Downton, A.C. ; Lucas, S.M. ; Patoulas, G. ; Beccaloni, G.W. ; Scoble, M.J. ; Robinson, G.S.
Author_Institution :
Dept. of Electron. Syst. Eng., Essex Univ., UK
Abstract :
This paper summarises the achievements of a multidisciplinary Bioinformatics project which has the objective of providing a general mechanism for efficient computerisation of typewritten/hand-annotated archive card indexes, of the type found in most museums, archives and libraries. In addition to efficiently scanning, recognising and databasing the content of the cards, the original card images must be maintained as the ultimate source record, and a flexible database structure is required to allow taxonomists to reorganise and update the resulting online archive. Implementation mechanisms for each part of the overall system are described, and conversion performance for a demonstrator database of 27,578 Pyralid moth archive cards is reported. The system is currently being used to convert the full NHM archive of Lepidoptera totalling 290,886 cards.
Keywords :
database indexing; document image processing; handwritten character recognition; library automation; text analysis; Lepidoptera cards; NHM archive; Pyralid moth archive cards; automation; content databasing; content recognition; content scanning; conversion performance; demonstrator database; efficient computerisation; flexible database structure; hand-annotated archive card indexes; libraries; multidisciplinary Bioinformatics project; museums; natural history card archives; online archive; original card images; source record; taxonomists; typewritten archive card indexes; Bioinformatics; Computer science; History; Image converters; Image databases; Image recognition; Libraries; Organisms; Systems engineering and theory; Text analysis;
Conference_Titel :
Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
Print_ISBN :
0-7695-1960-1
DOI :
10.1109/ICDAR.2003.1227688