Closed-form estimation of the amplitude commands in the automatic extraction of the Fujisaki´s model

Author

Silva, S.D.S. ; Netto, Sergio L.

Author_Institution

COPPE, Univ. Fed. do Rio de Janeiro, Brazil

Volume

1

fYear

2004

fDate

17-21 May 2004

Abstract

Generation of F0 contours is required for natural-sounding text-to-speech systems. This task can be accomplished using the Fujisaki model, proven to be very good to describe F0 contours based on simple linguistically motivated parameters. However, the extraction of the Fujisaki model parameters is a very intricate problem. Several methods were proposed to solve this problem using iterative optimization techniques. This paper presents a new method capable of extracting the amplitude parameters of the Fujisaki model analytically. The time-marking commands are still obtained via iterative optimization. The result is a more accurate and less computationally intensive amplitude determination due to the proposed closed-form solution. Examples are included illustrating the application of the proposed method.

Keywords

amplitude estimation; feature extraction; frequency estimation; iterative methods; optimisation; speech intelligibility; speech synthesis; F0 contour generation; Fujisaki model; amplitude commands; automatic extraction; closed-form estimation; iterative optimization; natural-sounding text-to-speech systems; pitch contours; simple linguistically motivated parameters; time-marking commands; Amplitude estimation; Closed-form solution; Iterative algorithms; Iterative methods; Mathematical model; Optimization methods; Physiology; Quantum computing; Speech synthesis; Yield estimation;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on

ISSN

1520-6149

Print_ISBN

0-7803-8484-9

Type

conf

DOI

10.1109/ICASSP.2004.1326062

Filename

1326062