مرکز منطقه ای اطلاع رساني علوم و فناوري - Explicit duration modelling in HMM-based speech synthesis using continuous hidden Markov Model

DocumentCode :

3432302

Title :

Explicit duration modelling in HMM-based speech synthesis using continuous hidden Markov Model

Author :

Ogbureke, Udochukwu ; Cabral, Joao ; Berndsen, Julie

Author_Institution :

Sch. of Comput. Sci. & Inf., Univ. Coll. Dublin, Dublin, Ireland

fYear :

2012

fDate :

2-5 July 2012

Firstpage :

700

Lastpage :

705

Abstract :

This paper presents a novel approach to explicit duration modelling for HMM-based speech synthesis. The proposed approach is a two-step process. The first step in this process is state level phone alignment and conversion of phone durations into the number of frames. In the second step, a hidden Markov model (HMM) is trained whereby the observation is the number of frames in each state and the hidden state the phone. Finally, the duration of each state (the number of frames) is generated from the trained HMM. Hidden semi-Markov model (HSMM) is the baseline for explicit duration modelling in HMM-based speech synthesis. Both objective and perceptual evaluation on a held-out test set showed comparable results with a baseline HSMM-based speech synthesis. This duration modelling approach is computationally simpler than HSMM and produces comparable results in terms of the quality of synthetic speech.

Keywords :

hidden Markov models; speech synthesis; HMM-based speech synthesis; continuous hidden Markov model; explicit duration; hidden semiMarkov model; perceptual evaluation; phone duration conversion; state level phone alignment; trained HMM; Computational modeling; Context; Context modeling; Hidden Markov models; Speech; Speech synthesis; Training; Duration modelling; HMM-based TTS; hidden Markov model;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on

Conference_Location :

Montreal, QC

Print_ISBN :

978-1-4673-0381-1

Electronic_ISBN :

978-1-4673-0380-4

Type :

conf

DOI :

10.1109/ISSPA.2012.6310643

Filename :

6310643

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3432302