DocumentCode
417224
Title
Closed-form estimation of the amplitude commands in the automatic extraction of the Fujisaki´s model
Author
Silva, S.D.S. ; Netto, Sergio L.
Author_Institution
COPPE, Univ. Fed. do Rio de Janeiro, Brazil
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
Generation of F0 contours is required for natural-sounding text-to-speech systems. This task can be accomplished using the Fujisaki model, proven to be very good to describe F0 contours based on simple linguistically motivated parameters. However, the extraction of the Fujisaki model parameters is a very intricate problem. Several methods were proposed to solve this problem using iterative optimization techniques. This paper presents a new method capable of extracting the amplitude parameters of the Fujisaki model analytically. The time-marking commands are still obtained via iterative optimization. The result is a more accurate and less computationally intensive amplitude determination due to the proposed closed-form solution. Examples are included illustrating the application of the proposed method.
Keywords
amplitude estimation; feature extraction; frequency estimation; iterative methods; optimisation; speech intelligibility; speech synthesis; F0 contour generation; Fujisaki model; amplitude commands; automatic extraction; closed-form estimation; iterative optimization; natural-sounding text-to-speech systems; pitch contours; simple linguistically motivated parameters; time-marking commands; Amplitude estimation; Closed-form solution; Iterative algorithms; Iterative methods; Mathematical model; Optimization methods; Physiology; Quantum computing; Speech synthesis; Yield estimation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326062
Filename
1326062
Link To Document