مرکز منطقه ای اطلاع رساني علوم و فناوري - Challenges in baseline detection of cursive script languages

DocumentCode :

2186719

Title :

Challenges in baseline detection of cursive script languages

Author :

Naz, Sabiha ; Hayat, K. ; Anwar, Muhammad Waqas ; Akbar, Habib ; Razzak, Muhammad Imran

Author_Institution :

COMSATS Inst. of Inf. Technol., Abbottabad, Pakistan

fYear :

2013

fDate :

7-9 Oct. 2013

Firstpage :

551

Lastpage :

556

Abstract :

Optical Character Recognition (OCR) is an important task with the rapid growth of the digital computers, online information services, PDAs and for conversion of text documents into digital text. This task enhances preservation of records and makes the access to documents easier. So, Baseline detection is an important step in the OCR because it directly affects the rest of the steps and increase the performance and efficiency of character segmentation and feature extraction in OCR process. In this paper, we provide a comprehensive review of baseline detection methods for Urdu language. The aim of the paper is to introduce the challenges during baseline detection in cursive script languages for Nasta´liq and Naskh font.

Keywords :

natural language processing; optical character recognition; text analysis; Naskh font; Nasta´liq font; OCR; Urdu language; baseline detection; cursive script languages; optical character recognition; Character recognition; Estimation; Feature extraction; Handwriting recognition; Optical character recognition software; Shape; Writing; Baseline; Naskh; Nasta´liq; two descender lines;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Science and Information Conference (SAI), 2013

Conference_Location :

London

Type :

conf

Filename :

6661792

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2186719