• DocumentCode
    573180
  • Title

    A new framework based on signature patches, micro registration, and sparse representation for optical text recognition

  • Author

    Moghaddam, Reza Farrahi ; Moghaddam, Fereydoun Farrahi ; Cheriet, Mohamed

  • Author_Institution
    Ecole de Technol. Super., Synchromedia Lab. for Multimedia Commun. in Telepresence, Montreal, QC, Canada
  • fYear
    2012
  • fDate
    2-5 July 2012
  • Firstpage
    1259
  • Lastpage
    1265
  • Abstract
    A framework for development of segmentation-free optical recognizers of ancient manuscripts, which work free from line, word, and character segmentation, is proposed. The framework introduces a new representation of visual text using the concept of signature patches. These patches which are free from traditional guidelines of text, such as the baseline, are registered to each other using a microscale registration method based on the estimation of the active regions using a multilevel classifier, the directional map. Then, an one-dimensional feature vector is extracted from the registered signature patches, named spiral features. The incremental learning process is performed using a sparse representation using a dictionary of spiral feature atoms. The framework is applied to the George Washington database with promising results.
  • Keywords
    document image processing; feature extraction; image registration; image representation; image segmentation; learning (artificial intelligence); optical character recognition; text analysis; text detection; George Washington database; active regions estimation; ancient manuscripts; character segmentation; dictionary; directional map; feature extraction; incremental learning process; line segmentation; microregistration; microscale registration method; multilevel classifier; one-dimensional feature vector; optical text recognition; segmentation-free optical recognizers; signature patches; sparse representation; spiral feature atoms; spiral features; text guidelines; visual text; word segmentation; Complexity theory; Data models; Dictionaries; Image segmentation; Spirals; Standards; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
  • Conference_Location
    Montreal, QC
  • Print_ISBN
    978-1-4673-0381-1
  • Electronic_ISBN
    978-1-4673-0380-4
  • Type

    conf

  • DOI
    10.1109/ISSPA.2012.6310485
  • Filename
    6310485