DocumentCode
2931391
Title
Speech synthesis system based on a variable decimation/interpolation factor
Author
De los Galanes, F. M Giménez ; Savoji, M.H. ; Pardo, J.M.
Author_Institution
ETSI Telecomunicacion, Univ. Politecnica de Madrid, Spain
Volume
1
fYear
1995
fDate
9-12 May 1995
Firstpage
636
Abstract
In this paper we present a modification of the usual decimation-interpolation steps for resampling of speech signals which is especially adapted to arbitrary modification of fundamental frequency and duration of speech segments. The modification is intended to overcome the time and frequency domain limitation that such a resampling scheme imposes so it can be used in a speech synthesis system. The performance of this resampling method for prosody modification is better than the equivalent PSOLA (Pitch-Synchronous Overlap-Add) method when working at a sampling frequency of 8 to 10 kilohertz so the source spectrum of the voiced allophones can be said to be completely harmonical. An optimization of the proposed algorithm that allows a real time implementation is also discussed
Keywords
interpolation; optimisation; signal sampling; speech synthesis; algorithm; frequency domain limitation; fundamental frequency; interpolation factor; optimization; performance; prosody modification; real time implementation; resampling; source spectrum; speech segments duration; speech signals; speech synthesis system; time domain limitation; variable decimation; voiced allophones; Databases; Frequency domain analysis; Interpolation; Linear predictive coding; Phase change materials; Sampling methods; Signal analysis; Speech analysis; Speech synthesis; Telecommunications;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479678
Filename
479678
Link To Document