• DocumentCode
    2220063
  • Title

    Handwritten document retrieval

  • Author

    Russell, Gregory ; Perrone, Michael P. ; Yi-min Chee ; Ziq, Aiman

  • Author_Institution
    Pen Technol. Group, IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
  • fYear
    2002
  • fDate
    6-8 Aug. 2002
  • Firstpage
    233
  • Lastpage
    238
  • Abstract
    This paper investigates the use of both typed and handwritten queries to retrieve handwritten documents. The recognition-based approach reported here is novel in that it expands documents in a fashion analogous to query expansion: Individual documents are expanded using N-best lists which embody additional statistical information from a hidden Markov model (HMM) based handwriting recognizer used to transcribe each of the handwritten documents. This additional information enables the retrieval methods to be robust to machine transcription errors, retrieving documents which otherwise would be unretrievable. Cross-writer experiments on a database of 10985 words in 108 documents from 108 writers, and within-writer experiments in a probabilistic framework, on a database of 537724 words in 3342 documents from 43 writers, indicate that significant improvements in retrieval performance can be achieved. The second database is the largest database of on-line handwritten documents known to its.
  • Keywords
    handwritten character recognition; hidden Markov models; image retrieval; HMM; N-best lists; cross-writer experiments; handwritten document retrieval; handwritten queries; hidden Markov model; machine transcription error robustness; typed queries; Character recognition; Databases; Degradation; Handwriting recognition; Hidden Markov models; Information retrieval; Ink; Optical character recognition software; Redundancy; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in Handwriting Recognition, 2002. Proceedings. Eighth International Workshop on
  • Conference_Location
    Niagara on the Lake, Ontario, Canada
  • Print_ISBN
    0-7695-1692-0
  • Type

    conf

  • DOI
    10.1109/IWFHR.2002.1030915
  • Filename
    1030915