• DocumentCode
    423805
  • Title

    Optical formula recognition based on structural features

  • Author

    Tian, Xuedong

  • Author_Institution
    Fac. of Math. & Comput. Sci., Hebei Univ., Baoding, China
  • Volume
    6
  • fYear
    2004
  • fDate
    26-29 Aug. 2004
  • Firstpage
    3741
  • Abstract
    Automatic recognition of formulas is one of the key parts in an OCR system. It could be really useful to be able to re-use knowledge in the scientific books which are not available in electronic form. A method of optical formula recognition is described. It consists of two major steps, namely, symbol recognition and structural analysis. Firstly, the search and process connect the components to gain the symbol components followed by symbol recognition. After that, we analyze the structure of the formula on the basis of the recognition result and the geometry features. The system works reliably on almost noiseless images obtained by scanning among the usual documents clearly printed.
  • Keywords
    document image processing; optical character recognition; optical formula recognition; structural analysis; structural feature; symbol recognition; Character recognition; Computer science; Geometrical optics; Image analysis; Image recognition; Mathematics; Optical character recognition software; Optical devices; Optical noise; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
  • Print_ISBN
    0-7803-8403-2
  • Type

    conf

  • DOI
    10.1109/ICMLC.2004.1380470
  • Filename
    1380470