DocumentCode :
2186719
Title :
Challenges in baseline detection of cursive script languages
Author :
Naz, Sabiha ; Hayat, K. ; Anwar, Muhammad Waqas ; Akbar, Habib ; Razzak, Muhammad Imran
Author_Institution :
COMSATS Inst. of Inf. Technol., Abbottabad, Pakistan
fYear :
2013
fDate :
7-9 Oct. 2013
Firstpage :
551
Lastpage :
556
Abstract :
Optical Character Recognition (OCR) is an important task with the rapid growth of the digital computers, online information services, PDAs and for conversion of text documents into digital text. This task enhances preservation of records and makes the access to documents easier. So, Baseline detection is an important step in the OCR because it directly affects the rest of the steps and increase the performance and efficiency of character segmentation and feature extraction in OCR process. In this paper, we provide a comprehensive review of baseline detection methods for Urdu language. The aim of the paper is to introduce the challenges during baseline detection in cursive script languages for Nasta´liq and Naskh font.
Keywords :
natural language processing; optical character recognition; text analysis; Naskh font; Nasta´liq font; OCR; Urdu language; baseline detection; cursive script languages; optical character recognition; Character recognition; Estimation; Feature extraction; Handwriting recognition; Optical character recognition software; Shape; Writing; Baseline; Naskh; Nasta´liq; two descender lines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Science and Information Conference (SAI), 2013
Conference_Location :
London
Type :
conf
Filename :
6661792
Link To Document :
بازگشت