• DocumentCode
    1993934
  • Title

    Computerising natural history card archives

  • Author

    Downton, A.C. ; Lucas, S.M. ; Patoulas, G. ; Beccaloni, G.W. ; Scoble, M.J. ; Robinson, G.S.

  • Author_Institution
    Dept. of Electron. Syst. Eng., Essex Univ., UK
  • fYear
    2003
  • fDate
    3-6 Aug. 2003
  • Firstpage
    354
  • Abstract
    This paper summarises the achievements of a multidisciplinary Bioinformatics project which has the objective of providing a general mechanism for efficient computerisation of typewritten/hand-annotated archive card indexes, of the type found in most museums, archives and libraries. In addition to efficiently scanning, recognising and databasing the content of the cards, the original card images must be maintained as the ultimate source record, and a flexible database structure is required to allow taxonomists to reorganise and update the resulting online archive. Implementation mechanisms for each part of the overall system are described, and conversion performance for a demonstrator database of 27,578 Pyralid moth archive cards is reported. The system is currently being used to convert the full NHM archive of Lepidoptera totalling 290,886 cards.
  • Keywords
    database indexing; document image processing; handwritten character recognition; library automation; text analysis; Lepidoptera cards; NHM archive; Pyralid moth archive cards; automation; content databasing; content recognition; content scanning; conversion performance; demonstrator database; efficient computerisation; flexible database structure; hand-annotated archive card indexes; libraries; multidisciplinary Bioinformatics project; museums; natural history card archives; online archive; original card images; source record; taxonomists; typewritten archive card indexes; Bioinformatics; Computer science; History; Image converters; Image databases; Image recognition; Libraries; Organisms; Systems engineering and theory; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
  • Print_ISBN
    0-7695-1960-1
  • Type

    conf

  • DOI
    10.1109/ICDAR.2003.1227688
  • Filename
    1227688