DocumentCode
2407185
Title
Creation of acoustic signal dictionary for ESNOLA based concatenated Bangla and Nepali TTS system
Author
Khan, Soma ; Roy, Rajib
Author_Institution
Centre for Dev. of Adv. Comput., Kolkata, India
fYear
2011
fDate
26-28 Oct. 2011
Firstpage
162
Lastpage
167
Abstract
Present paper describes the detail design and development of two different acoustic signal dictionaries for incorporating separately into Epoch Synchronous Non OverLap Add (ESNOLA) method based Bangla (SCB) and Nepali concatenative ´ITS system. Present work uses a new set of signal units in sub-phonemic level, namely, Partnemes and allows a flexible approach to the length of transitions. Partnemes include identifiable portions unique for phonemes, their transitions and co-articulations. The creation process includes a series of normalization (Pitch, Amplitude and DC) with judicial selection and augmentation of speech segments such that smaller fundamental yet appropriate parts of the phonemes, interphoneme and inter-word transitions can be used as acoustic units. Encouraging results of the listening test confirm good perceptual quality and acceptability of the developed signal dictionaries. ESNOLA framework with optimal size partneme inventories altogether give a simple approach for generation of high quality synthesized speech with easy portability to hand-held devices.
Keywords
natural language processing; speech synthesis; Bangla-Nepali TTS system; ESNOLA; acoustic signal dictionary creation; epoch synchronous nonoverlap add method; hand-held devices; interphoneme; interword transitions; partnemes; speech segments; subphonemic level; Buildings; Dictionaries; Materials; Speech; Speech processing; Synthesizers; ESNOLA; Partnemes; TTS; Transition;
fLanguage
English
Publisher
ieee
Conference_Titel
Speech Database and Assessments (Oriental COCOSDA), 2011 International Conference on
Conference_Location
Hsinchu
Print_ISBN
978-1-4577-0930-2
Type
conf
DOI
10.1109/ICSDA.2011.6086000
Filename
6086000
Link To Document