Can RNNs reliably separate script and language at word and line level?

Author

Ajeet Kumar Singh;C. V. Jawahar

Author_Institution

Center for Visual Information Technology, IIIT Hyderabad, India

fYear

2015

Firstpage

976

Lastpage

980

Abstract

In this work, we investigate the utility of Recurrent Neural Networks (RNNs) for script and language identification. Both these problems have been attempted in the past with representations computed from the distribution of connected components or characters (e.g. texture, n-gram). Often these features are computed from a larger segment (a paragraph or a page). We argue that one can predict the script or language with minimal evidence (e.g. given only a word or a line) very accurately with the help of a pre-trained RNN. We propose a simple and generic solution for the task of script and language identification which do not require any special tuning. Our method represents the word images as a sequence of feature vectors, and employ the RNNs for the identification. We verify the method on a large corpus of more than 15.03M words from 55K document images comprising 15 scripts and languages. We report an accurate script and language identification at word and line level.

Keywords

"Reliability","Logic gates","Noise measurement"

Publisher

ieee

Conference_Titel

Document Analysis and Recognition (ICDAR), 2015 13th International Conference on

Type

conf

DOI

10.1109/ICDAR.2015.7333907

Filename

7333907

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3695237