• DocumentCode
    1637694
  • Title

    Development and Evaluation of Text Localization Techniques Based on Structural Texture Features and Neural Classifiers

  • Author

    Emmanouilidis, Christos ; Batsalas, Costas ; Papamarkos, Nikos

  • Author_Institution
    Comput. Syst. Unit, CETI/R.C.Athena, Xanthi, Greece
  • fYear
    2009
  • Firstpage
    1270
  • Lastpage
    1274
  • Abstract
    This paper presents a text localization approach for binarized printed document images. Emphasis is given to the feature extraction and feature selection stages. In the former, several document structure elements and spatial features, likely to convey useful information, are extracted. In the latter, evolutionary multi-objective feature selection is employed to identify combinations of features with simultaneous good performance in terms of text localization sensitivity and specificity. The selected features are applied to a range of classifiers. Performance results over document image sets from known databases are presented, employing the classifiers with or without feature selection. The results suggest that the hybrid techniques, which utilize the classifiers in combination with the customized pre-processing, feature extraction and feature selection stages, exhibit promising performance on a range of document images.
  • Keywords
    document image processing; evolutionary computation; feature extraction; image classification; image segmentation; image texture; text analysis; binarized printed document image; document image processing; evolutionary multiobjective feature selection; feature extraction; neural classifier; structural texture feature; text localization approach; Data mining; Feature extraction; Image analysis; Image recognition; Image segmentation; Image texture analysis; Sensitivity and specificity; Support vector machine classification; Support vector machines; Text analysis; Page layout analysis; evolutionary multiobjective algorithms; feature extraction; feature selection; neural networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4244-4500-4
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2009.254
  • Filename
    5277681