On the implementation of the harmonic plus noise model for concatenative speech synthesis

Author

Stylianou, Yannis

Author_Institution

SIPS, AT&T Bell Labs., Florham Park, NJ, USA

Volume

2

fYear

2000

fDate

2000

Abstract

In concatenative speech synthesis systems, speech models are usually used to represent the speech signal. Recently, the harmonic plus noise model (HNM) has been proposed for concatenative speech synthesis with promising results. One main drawback of HNM is its complexity. In this paper, we review four different methods of reducing the complexity of HNM. These include, straight-forward synthesis(SF), synthesis using inverse fast Fourier transform (IFFT), synthesis using recurrence relations for trigonometric functions (RR), and synthesis based on delayed multi-resampled cosine functions (DMRC). DMRC was shown to outperform all the other techniques reducing the complexity of HNM synthesizer by 95% compared to the current version of the HNM which is based on the SF method. Informal listening tests showed that the version of HNM based on the DMRC method provides higher quality of speech synthesis than the version based on SF

Keywords

computational complexity; fast Fourier transforms; harmonic analysis; speech synthesis; complexity reduction; concatenative speech synthesis; delayed multi-resampled cosine functions; harmonic plus noise model; informal listening tests; inverse fast Fourier transform; recurrence relations; speech models; straight-forward synthesis; trigonometric functions; Databases; Frequency; Low-frequency noise; Noise generators; Phase noise; Power harmonic filters; Signal generators; Signal synthesis; Speech enhancement; Speech synthesis;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on

Conference_Location

Istanbul

ISSN

1520-6149

Print_ISBN

0-7803-6293-4

Type

conf

DOI

10.1109/ICASSP.2000.859120

Filename

859120