• DocumentCode
    2855620
  • Title

    Offline recognition of handwritten Chinese characters using Gabor features, CDHMM modeling and MCE training

  • Author

    Ge, Yong ; Huo, Qiang ; Feng, Zhi-Dan

  • Author_Institution
    Dept. of Electronic Engineering & Information Science, University of Science and Technology of China, Hefei, Anhui, China
  • Volume
    1
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    We´ve been developing a Chinese OCR engine for handwritten Chinese scripts. Currently, our OCR engine supports a vocabulary of 4616 characters which include 4516 simplified Chinese characters in GB2312-80, 62 alphanumeric characters, 38 punctuation marks and symbols. By using 1,384,800 character samples to train our recognizer, an averaged character recognition accuracy of 96.34% is achieved on a testing set of 1,025,535 character samples. An arguably best Chinese OCR product on the market achieves an accuracy of 94.07% for the recognizable Chinese characters in the above testing set. In this paper, we describe key techniques used in our recognizer that contribute to the high recognition accuracy, namely the use of Gabor features and their spatial derivatives as raw features, the use of LDA for feature extraction and dimension reduction, the use of CDHMMs for modeling Chinese characters along both horizontal and vertical directions, and the use of minimum classification error as a criterion for model training.
  • Keywords
    Feature extraction; Hidden Markov models; Irrigation; Nickel; Training; Variable speed drives;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743976
  • Filename
    5743976