• DocumentCode
    177854
  • Title

    Confusion Network Based Recurrent Neural Network Language Modeling for Chinese OCR Error Detection

  • Author

    Jinying Chen ; Yue Wu ; Huaigu Cao ; Natarajan, P.

  • Author_Institution
    Dept. of Speech, Language & Multimedia, Raytheon BBN Technol., Cambridge, MA, USA
  • fYear
    2014
  • fDate
    24-28 Aug. 2014
  • Firstpage
    1266
  • Lastpage
    1271
  • Abstract
    This paper presents a new framework for OCR error detection, which uses a conditional random field model to combine rich features from multiple sources, including confusion networks (c-nets), lexical local context and recurrent neural network language model (RNNLM). We propose a novel, efficient method for computing character-level c-net based RNNLM scores by using dynamic programming and c-net partial unfolding. Our experiments show that our error detection model has consistent observable improvements over a high baseline employed by our current OCR demo system, as measured by average precision and detection error trade-off curve on two test sets of Chinese image documents. Both linguistic and recognition features contribute to the high performance, with the former especially informative. In addition, we show that the new feature we proposed, the c-net RNNLM feature, plays a remarkable beneficial role in improving error detection rate. These results suggest that applications on top of image text recognition can benefit substantially from a hybrid strategy that combines techniques from optical character recognition and natural language processing.
  • Keywords
    document image processing; dynamic programming; natural language processing; optical character recognition; recurrent neural nets; text analysis; Chinese OCR error detection; Chinese image documents; RNNLM; c-net partial unfolding; conditional random field model; confusion network; dynamic programming; image text recognition; lexical local context; natural language processing; optical character recognition; recurrent neural network language modeling; Character recognition; Feature extraction; Hidden Markov models; Lattices; Optical character recognition software; Pragmatics; Speech recognition; confusion networks; error detection; optical character recognition; recurrent neural network language model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2014 22nd International Conference on
  • Conference_Location
    Stockholm
  • ISSN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2014.227
  • Filename
    6976937