DocumentCode :
2226828
Title :
Prosody modeling for an embedded TTS system implementation
Author :
Burileanu, Dragos ; Negrescu, Cristian
Author_Institution :
Fac. of Electron., Telecommun. & IT, “Politeh.” Univ. of Bucharest, Bucharest, Romania
fYear :
2006
fDate :
4-8 Sept. 2006
Firstpage :
1
Lastpage :
5
Abstract :
Prosody quality strongly influences the intelligibility and the perceived naturalness of synthetic speech. But despite the significant progress in prosody modeling from the last years, incomplete linguistic knowledge that can be derived from text and various language-specific issues still limit the quality of today´s commercial text-to-speech (TTS) systems. Moreover, obtaining a right pronunciation and intonation for embedded speech applications that have severe resource constraints, is a more challenging task. The paper describes an enhanced version of an embedded TTS system in Romanian language and proposes and discusses an efficient rule-based intonation model for prosody generation. Informal listening tests show that highly intelligible and fair natural synthetic speech can be produced with small memory footprint and low computational resources.
Keywords :
natural language processing; speech intelligibility; speech synthesis; Romanian language; embedded TTS system; embedded speech; incomplete linguistic knowledge; informal listening test; language-specific issue; natural synthetic speech; prosody generation; prosody modeling; rule-based intonation model; speech intelligibility; text-to-speech system; Analytical models; Databases; Europe; Pragmatics; Signal processing; Speech; Stress;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2006 14th European
Conference_Location :
Florence
ISSN :
2219-5491
Type :
conf
Filename :
7071708
Link To Document :
بازگشت