DocumentCode
1993934
Title
Computerising natural history card archives
Author
Downton, A.C. ; Lucas, S.M. ; Patoulas, G. ; Beccaloni, G.W. ; Scoble, M.J. ; Robinson, G.S.
Author_Institution
Dept. of Electron. Syst. Eng., Essex Univ., UK
fYear
2003
fDate
3-6 Aug. 2003
Firstpage
354
Abstract
This paper summarises the achievements of a multidisciplinary Bioinformatics project which has the objective of providing a general mechanism for efficient computerisation of typewritten/hand-annotated archive card indexes, of the type found in most museums, archives and libraries. In addition to efficiently scanning, recognising and databasing the content of the cards, the original card images must be maintained as the ultimate source record, and a flexible database structure is required to allow taxonomists to reorganise and update the resulting online archive. Implementation mechanisms for each part of the overall system are described, and conversion performance for a demonstrator database of 27,578 Pyralid moth archive cards is reported. The system is currently being used to convert the full NHM archive of Lepidoptera totalling 290,886 cards.
Keywords
database indexing; document image processing; handwritten character recognition; library automation; text analysis; Lepidoptera cards; NHM archive; Pyralid moth archive cards; automation; content databasing; content recognition; content scanning; conversion performance; demonstrator database; efficient computerisation; flexible database structure; hand-annotated archive card indexes; libraries; multidisciplinary Bioinformatics project; museums; natural history card archives; online archive; original card images; source record; taxonomists; typewritten archive card indexes; Bioinformatics; Computer science; History; Image converters; Image databases; Image recognition; Libraries; Organisms; Systems engineering and theory; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
Print_ISBN
0-7695-1960-1
Type
conf
DOI
10.1109/ICDAR.2003.1227688
Filename
1227688
Link To Document