DocumentCode
3343673
Title
Joint prosody prediction and unit selection for concatenative speech synthesis
Author
Bulyko, Ivan ; Ostendorf, Muri
Author_Institution
Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
Volume
2
fYear
2001
fDate
2001
Firstpage
781
Abstract
We describe how prosody prediction can be efficiently integrated with the unit selection process in a concatenative speech synthesizer under a weighted finite-state transducer (WFST) architecture. WFSTs representing prosody prediction and unit selection can be composed during synthesis, thus effectively expanding the space of possible prosodic targets. We implemented a symbolic prosody prediction module and a unit selection database as the synthesis components of a travel planning system. Results of perceptual experiments show that by combining the steps of prosody prediction and unit selection we are able to achieve improved naturalness of synthetic speech compared to the sequential implementation
Keywords
speech intelligibility; speech synthesis; travel industry; concatenative speech synthesis; perceptual experiments; symbolic prosody prediction module; synthetic speech naturalness; travel planning system; unit selection database; weighted finite-state transducer; Computer interfaces; Cost function; Databases; Diversity reception; Space exploration; Speech processing; Speech synthesis; Synthesizers; Telephony; Transducers;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location
Salt Lake City, UT
ISSN
1520-6149
Print_ISBN
0-7803-7041-4
Type
conf
DOI
10.1109/ICASSP.2001.941031
Filename
941031
Link To Document