DocumentCode :
2486356
Title :
Segmental Duration Modeling for Greek Speech Synthesis
Author :
Lazaridis, Alexandros ; Zervas, Panagiotis ; Kokkinakis, George
Author_Institution :
Univ. of Patras, Patras
Volume :
2
fYear :
2007
fDate :
29-31 Oct. 2007
Firstpage :
518
Lastpage :
521
Abstract :
In this paper we cope with the task of modeling phoneme duration for Greek speech synthesis. In particular we apply well established machine learning approaches to the WCL-1 prosodic database for predicting segmental durations from shallow morphosyntactic and prosodic features. We employ decision trees, instance based learning and linear regression. Trained on a 5500 word database, both CART and linear regression models proved to be the most effective in terms for the task with a root mean square error off 0. 0252 and 0.0251 respectively.
Keywords :
decision trees; learning (artificial intelligence); regression analysis; speech synthesis; Greek speech synthesis; WCL-1 prosodic database; decision trees; instance based learning; linear regression; machine learning; morphosyntactic features; phoneme duration; prosodic features; root mean square error; segmental duration modeling; Artificial intelligence; Artificial neural networks; Bayesian methods; Decision trees; Linear regression; Machine learning; Root mean square; Spatial databases; Speech synthesis; Wire;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence, 2007. ICTAI 2007. 19th IEEE International Conference on
Conference_Location :
Patras
ISSN :
1082-3409
Print_ISBN :
978-0-7695-3015-4
Type :
conf
DOI :
10.1109/ICTAI.2007.33
Filename :
4410432
Link To Document :
بازگشت