DocumentCode :
3431360
Title :
Intonational phrase break prediction for text-to-speech synthesis using dependency relations
Author :
Mishra, Taniya ; Yeon-jun Kim ; Bangalore, Srinivas
Author_Institution :
Interactions, Franklin, MA, USA
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
4919
Lastpage :
4923
Abstract :
Intonational phrase (IP) break prediction is an important aspect of front-end analysis in a text-to-speech system. Standard approaches for intonational phrase break prediction rely on the use of linguistic rules or more recently, lexicalized data-driven models. Linguistic rules are not robust while data-driven models based on lexical identity do not generalize across domains. To overcome these challenges, in this paper, we explore the use of syntactic features to predict intonational phrase breaks. On a test set of over 40 thousand words, while a lexically driven IP break prediction model yields an F-score of 0.82, a non-lexicalized model that uses part-of-speech tags and dependency relations achieves an F-score of 0.81 with added feature of being more portable across domains. In this work, we also examine the effect of contextual information on prediction performance. Our evaluation shows that using a three-token left context in a POS-tag based model results in only a 2% drop in recall compared to a model that uses both a left and right context, which suggests the viability of using such a model for incremental text-to-speech system.
Keywords :
speech synthesis; IP break prediction; POS-tag based model; dependency relations; front-end analysis; incremental text-to-speech synthesis system; intonational phrase break prediction; lexicalized data-driven models; nonlexicalized model; part-of-speech tags; syntactic features; three-token left context; Computational modeling; Context; Context modeling; IP networks; Predictive models; Speech; Syntactics; IP prediction; Intonational phrase; phrase breaks; prosody; text-analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178906
Filename :
7178906
Link To Document :
بازگشت