مرکز منطقه ای اطلاع رساني علوم و فناوري - Strokelets: A Learned Multi-scale Representation for Scene Text Recognition

DocumentCode :

254465

Title :

Strokelets: A Learned Multi-scale Representation for Scene Text Recognition

Author :

Cong Yao ; Xiang Bai ; Baoguang Shi ; Wenyu Liu

fYear :

2014

fDate :

23-28 June 2014

Firstpage :

4042

Lastpage :

4049

Abstract :

Driven by the wide range of applications, scene text detection and recognition have become active research topics in computer vision. Though extensively studied, localizing and reading text in uncontrolled environments remain extremely challenging, due to various interference factors. In this paper, we propose a novel multi-scale representation for scene text recognition. This representation consists of a set of detectable primitives, termed as strokelets, which capture the essential substructures of characters at different granularities. Strokelets possess four distinctive advantages: (1) Usability: automatically learned from bounding box labels, (2) Robustness: insensitive to interference factors, (3) Generality: applicable to variant languages, and (4) Expressivity: effective at describing characters. Extensive experiments on standard benchmarks verify the advantages of strokelets and demonstrate the effectiveness of the proposed algorithm for text recognition.

Keywords :

character recognition; computer vision; image recognition; image representation; text detection; character recognition; computer vision; interference factors; multiscale representation; scene text detection; scene text recognition; strokelets; Character recognition; Clustering algorithms; Noise; Prototypes; Robustness; Text recognition; Training;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on

Conference_Location :

Columbus, OH

Type :

conf

DOI :

10.1109/CVPR.2014.515

Filename :

6909911

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=254465