مرکز منطقه ای اطلاع رساني علوم و فناوري - RIch-context Unit Selection (RUS) approach to high quality TTS

DocumentCode :

2792046

Title :

RIch-context Unit Selection (RUS) approach to high quality TTS

Author :

Yan, Zhi-Jie ; Qian, Yao ; Soong, Frank K.

Author_Institution :

Microsoft Res. Asia, Beijing, China

fYear :

2010

fDate :

14-19 March 2010

Firstpage :

4798

Lastpage :

4801

Abstract :

This paper presents a Rich-context Unit Selection (RUS) approach to high quality speech synthesis. Based upon our previous work on rich context modeling, we use the corresponding parametric HMMs to represent waveform units and form a “sausage-like” lattice. A prune-and-search procedure is proposed, in which Kullback-Leibler divergence is adopted to select potential candidate units, and normalized cross-correlation is used as the final objective measure to search for the optimal unit path. The maximum cross-correlation criterion provides the optimal concatenation between successive units, in terms of spectral similarity, phase continuity and best connecting timing instants. Subjectively, both preference and MOS tests were conducted to compare RUS with our current Weight-table based Unit Selection (WUS) synthesis. Experimental results show that the voice quality of synthesized speech is significantly improved by RUS over the conventional WUS.

Keywords :

correlation methods; hidden Markov models; speech synthesis; HMM; Kullback-Leibler divergence; MOS test; high quality TTS; high quality speech synthesis; normalized cross correlation; optimal concatenation; potential candidate unit; rich context modeling; rich-context unit selection; voice quality; weight-table based unit selection synthesis; Asia; Context modeling; Hidden Markov models; Joining processes; Labeling; Lattices; Prototypes; Speech synthesis; Testing; Timing; Hidden Markov models; Rich-context unit selection (RUS); Speech synthesis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on

Conference_Location :

Dallas, TX

ISSN :

1520-6149

Print_ISBN :

978-1-4244-4295-9

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2010.5495150

Filename :

5495150

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2792046