مرکز منطقه ای اطلاع رساني علوم و فناوري - Prosody modeling for an embedded TTS system implementation

DocumentCode :

2226828

Title :

Prosody modeling for an embedded TTS system implementation

Author :

Burileanu, Dragos ; Negrescu, Cristian

Author_Institution :

Fac. of Electron., Telecommun. & IT, “Politeh.” Univ. of Bucharest, Bucharest, Romania

fYear :

2006

fDate :

4-8 Sept. 2006

Firstpage :

Lastpage :

Abstract :

Prosody quality strongly influences the intelligibility and the perceived naturalness of synthetic speech. But despite the significant progress in prosody modeling from the last years, incomplete linguistic knowledge that can be derived from text and various language-specific issues still limit the quality of today´s commercial text-to-speech (TTS) systems. Moreover, obtaining a right pronunciation and intonation for embedded speech applications that have severe resource constraints, is a more challenging task. The paper describes an enhanced version of an embedded TTS system in Romanian language and proposes and discusses an efficient rule-based intonation model for prosody generation. Informal listening tests show that highly intelligible and fair natural synthetic speech can be produced with small memory footprint and low computational resources.

Keywords :

natural language processing; speech intelligibility; speech synthesis; Romanian language; embedded TTS system; embedded speech; incomplete linguistic knowledge; informal listening test; language-specific issue; natural synthetic speech; prosody generation; prosody modeling; rule-based intonation model; speech intelligibility; text-to-speech system; Analytical models; Databases; Europe; Pragmatics; Signal processing; Speech; Stress;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Processing Conference, 2006 14th European

Conference_Location :

Florence

ISSN :

2219-5491

Type :

conf

Filename :

7071708

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2226828