DocumentCode
1652256
Title
Grapheme to Phoneme (G2P) conversion for Bangla
Author
Basu, Joyanta ; Basu, Tulika ; Mitra, Mridusmita ; Mandal, Srimanta
Author_Institution
Centre for Dev. of Adv. Comput. (C-DAC), Kolkata, India
fYear
2009
Firstpage
66
Lastpage
71
Abstract
The automatic conversion of text to phoneme is a necessary step in all-current approaches to text-to-speech (TTS) synthesis and automatic speech recognition system. This paper presents a methodology for grapheme to phoneme (G2P) conversion for Bangla based on orthographic rules. In Bangla G2P conversion sometimes depends not only on orthographic information but also on parts of speech (POS) information and semantics. This paper also addresses these issues along with their implementation methodology. The G2P conversion system of Bangla is tested on 1000 different types of Bangla sentences containing 9294 words. The percentage of correct conversion is 91.58% without considering the semantics and contextual POS with the exception table size of 333 words. If those errors which occur due to lack of exceptional words are considered, then the percentage of correct conversion will increase to 98%.
Keywords
natural language processing; speech recognition; speech synthesis; Bangla sentences; automatic speech recognition system; grapheme-to-phoneme conversion; orthographic information; orthographic rules; parts of speech information; semantics; text-to-phoneme conversion; text-to-speech synthesis; Automatic speech recognition; Dictionaries; Error correction; History; Modems; Morphology; Speech synthesis; System testing; Vocabulary; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
Speech Database and Assessments, 2009 Oriental COCOSDA International Conference on
Conference_Location
Urumqi
Print_ISBN
978-1-4244-4400-7
Electronic_ISBN
978-1-4244-4400-7
Type
conf
DOI
10.1109/ICSDA.2009.5278373
Filename
5278373
Link To Document