DocumentCode
573180
Title
A new framework based on signature patches, micro registration, and sparse representation for optical text recognition
Author
Moghaddam, Reza Farrahi ; Moghaddam, Fereydoun Farrahi ; Cheriet, Mohamed
Author_Institution
Ecole de Technol. Super., Synchromedia Lab. for Multimedia Commun. in Telepresence, Montreal, QC, Canada
fYear
2012
fDate
2-5 July 2012
Firstpage
1259
Lastpage
1265
Abstract
A framework for development of segmentation-free optical recognizers of ancient manuscripts, which work free from line, word, and character segmentation, is proposed. The framework introduces a new representation of visual text using the concept of signature patches. These patches which are free from traditional guidelines of text, such as the baseline, are registered to each other using a microscale registration method based on the estimation of the active regions using a multilevel classifier, the directional map. Then, an one-dimensional feature vector is extracted from the registered signature patches, named spiral features. The incremental learning process is performed using a sparse representation using a dictionary of spiral feature atoms. The framework is applied to the George Washington database with promising results.
Keywords
document image processing; feature extraction; image registration; image representation; image segmentation; learning (artificial intelligence); optical character recognition; text analysis; text detection; George Washington database; active regions estimation; ancient manuscripts; character segmentation; dictionary; directional map; feature extraction; incremental learning process; line segmentation; microregistration; microscale registration method; multilevel classifier; one-dimensional feature vector; optical text recognition; segmentation-free optical recognizers; signature patches; sparse representation; spiral feature atoms; spiral features; text guidelines; visual text; word segmentation; Complexity theory; Data models; Dictionaries; Image segmentation; Spirals; Standards; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
Conference_Location
Montreal, QC
Print_ISBN
978-1-4673-0381-1
Electronic_ISBN
978-1-4673-0380-4
Type
conf
DOI
10.1109/ISSPA.2012.6310485
Filename
6310485
Link To Document