مرکز منطقه ای اطلاع رساني علوم و فناوري - Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems

DocumentCode :

2064673

Title :

Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems

Author :

Oura, Keiichiro ; Nankaku, Yoshihiko ; Toda, Tomoki ; Tokuda, Keiichi ; Maia, Rannierry ; Sakai, Shinsuke ; Nakamura, Satoshi

Author_Institution :

Dept. of Comput. Sci. & Enginnering, Nagoya Inst. of Technol., Nagoya, Japan

fYear :

2008

fDate :

16-19 Dec. 2008

Firstpage :

Lastpage :

Abstract :

A new integrated model for simultaneous modeling of linguistic and acoustic models, and a training algorithm is proposed. Usually, text-to-speech (TTS) systems based on the hidden Markov model (HMM) consist of text analysis and speech synthesis modules. Linguistic and acoustic model training are performed independently using different training data sets. Integrated model parameters were simultaneously optimized by the proposed training algorithm. The derived algorithm optimizes two model parameter sets simultaneously. Therefore, the appropriate model is estimated because we can directly-formulate the TTS problem in which the speech waveform is generated from a word sequence. We conducted objective evaluation experiments using phrasing and prosodic models to evaluate the effectiveness of the proposed technique.

Keywords :

hidden Markov models; speech synthesis; acoustic model training; data set training; hidden Markov model; integrated model parameter; linguistic model training; phrasing model training; prosodic model training; speech synthesis modules; text-to-speech systems; word sequence; Computer science; Databases; Decision trees; Hidden Markov models; Natural languages; Speech synthesis; Tagging; Text analysis; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on

Conference_Location :

Kunming

Print_ISBN :

978-1-4244-2942-4

Electronic_ISBN :

978-1-4244-2943-1

Type :

conf

DOI :

10.1109/CHINSL.2008.ECP.12

Filename :

4730266

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2064673