Title :
Syllable segmentation of Telugu document images
Author :
Babu, Tummala Ranga ; Rao, Venkata N. ; Reddy, Pratap L. ; Prabhu, Krishna T S ; Babu, Raveendra B.
Author_Institution :
Dept. of ECE, RVR & JC Coll. of Eng., Guntur, India
Abstract :
This paper describes Syllable Segmentation method for printed text documents in Telugu, a South Indian language. Syllable segmentation is a fundamental process for various applications namely word segmentation, speech synthesis and information retrieval. The segmentation algorithm is motivated by the structure of the script. The segmentation process is complicated due to inherent ambiguities of the natural language. This paper presents a non-dictionary approach for syllable segmentation in the context of Telugu language, using the syllable width and compared with the existing aspect ratio approach. The experimental results show the segmentation accuracy is more than 99%.
Keywords :
document image processing; image segmentation; South Indian language; Telugu document images; printed text documents; syllable segmentation; Character recognition; Data mining; Image recognition; Image segmentation; Optical character recognition software; Shape; Signal processing; Aspect Ratio; Syllable Segmentation; Syllable Width;
Conference_Titel :
Signal Processing Systems (ICSPS), 2010 2nd International Conference on
Conference_Location :
Dalian
Print_ISBN :
978-1-4244-6892-8
Electronic_ISBN :
978-1-4244-6893-5
DOI :
10.1109/ICSPS.2010.5555217