Title :
Grapheme to Phoneme (G2P) conversion for Bangla
Author :
Basu, Joyanta ; Basu, Tulika ; Mitra, Mridusmita ; Mandal, Srimanta
Author_Institution :
Centre for Dev. of Adv. Comput. (C-DAC), Kolkata, India
Abstract :
The automatic conversion of text to phoneme is a necessary step in all-current approaches to text-to-speech (TTS) synthesis and automatic speech recognition system. This paper presents a methodology for grapheme to phoneme (G2P) conversion for Bangla based on orthographic rules. In Bangla G2P conversion sometimes depends not only on orthographic information but also on parts of speech (POS) information and semantics. This paper also addresses these issues along with their implementation methodology. The G2P conversion system of Bangla is tested on 1000 different types of Bangla sentences containing 9294 words. The percentage of correct conversion is 91.58% without considering the semantics and contextual POS with the exception table size of 333 words. If those errors which occur due to lack of exceptional words are considered, then the percentage of correct conversion will increase to 98%.
Keywords :
natural language processing; speech recognition; speech synthesis; Bangla sentences; automatic speech recognition system; grapheme-to-phoneme conversion; orthographic information; orthographic rules; parts of speech information; semantics; text-to-phoneme conversion; text-to-speech synthesis; Automatic speech recognition; Dictionaries; Error correction; History; Modems; Morphology; Speech synthesis; System testing; Vocabulary; Writing;
Conference_Titel :
Speech Database and Assessments, 2009 Oriental COCOSDA International Conference on
Conference_Location :
Urumqi
Print_ISBN :
978-1-4244-4400-7
Electronic_ISBN :
978-1-4244-4400-7
DOI :
10.1109/ICSDA.2009.5278373