DocumentCode
2281938
Title
On the implementation of the harmonic plus noise model for concatenative speech synthesis
Author
Stylianou, Yannis
Author_Institution
SIPS, AT&T Bell Labs., Florham Park, NJ, USA
Volume
2
fYear
2000
fDate
2000
Abstract
In concatenative speech synthesis systems, speech models are usually used to represent the speech signal. Recently, the harmonic plus noise model (HNM) has been proposed for concatenative speech synthesis with promising results. One main drawback of HNM is its complexity. In this paper, we review four different methods of reducing the complexity of HNM. These include, straight-forward synthesis(SF), synthesis using inverse fast Fourier transform (IFFT), synthesis using recurrence relations for trigonometric functions (RR), and synthesis based on delayed multi-resampled cosine functions (DMRC). DMRC was shown to outperform all the other techniques reducing the complexity of HNM synthesizer by 95% compared to the current version of the HNM which is based on the SF method. Informal listening tests showed that the version of HNM based on the DMRC method provides higher quality of speech synthesis than the version based on SF
Keywords
computational complexity; fast Fourier transforms; harmonic analysis; speech synthesis; complexity reduction; concatenative speech synthesis; delayed multi-resampled cosine functions; harmonic plus noise model; informal listening tests; inverse fast Fourier transform; recurrence relations; speech models; straight-forward synthesis; trigonometric functions; Databases; Frequency; Low-frequency noise; Noise generators; Phase noise; Power harmonic filters; Signal generators; Signal synthesis; Speech enhancement; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1520-6149
Print_ISBN
0-7803-6293-4
Type
conf
DOI
10.1109/ICASSP.2000.859120
Filename
859120
Link To Document