Speech synthesis system based on a variable decimation/interpolation factor

Author

De los Galanes, F. M Giménez ; Savoji, M.H. ; Pardo, J.M.

Author_Institution

ETSI Telecomunicacion, Univ. Politecnica de Madrid, Spain

Volume

1

fYear

1995

fDate

9-12 May 1995

Firstpage

636

Abstract

In this paper we present a modification of the usual decimation-interpolation steps for resampling of speech signals which is especially adapted to arbitrary modification of fundamental frequency and duration of speech segments. The modification is intended to overcome the time and frequency domain limitation that such a resampling scheme imposes so it can be used in a speech synthesis system. The performance of this resampling method for prosody modification is better than the equivalent PSOLA (Pitch-Synchronous Overlap-Add) method when working at a sampling frequency of 8 to 10 kilohertz so the source spectrum of the voiced allophones can be said to be completely harmonical. An optimization of the proposed algorithm that allows a real time implementation is also discussed

Keywords

interpolation; optimisation; signal sampling; speech synthesis; algorithm; frequency domain limitation; fundamental frequency; interpolation factor; optimization; performance; prosody modification; real time implementation; resampling; source spectrum; speech segments duration; speech signals; speech synthesis system; time domain limitation; variable decimation; voiced allophones; Databases; Frequency domain analysis; Interpolation; Linear predictive coding; Phase change materials; Sampling methods; Signal analysis; Speech analysis; Speech synthesis; Telecommunications;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location

Detroit, MI

ISSN

1520-6149

Print_ISBN

0-7803-2431-5

Type

conf

DOI

10.1109/ICASSP.1995.479678

Filename

479678