DocumentCode :
388551
Title :
Time alignment of natural speech to synthetic speech
Author :
Hunt, Melvyn J.
Author_Institution :
National Research Council of Canada, Ont., Canada
Volume :
9
fYear :
1984
fDate :
30742
Firstpage :
65
Lastpage :
68
Abstract :
A capacity to carry out reliable automatic time alignment of synthetic speech to naturally produced speech offers potential benfits in speech recognition and speaker recognition as well as in synthesis itself. Phrase alignment experiments are described that indicate that alignment to synthetic speech is more difficult than alignment of speech from two natural speakers. An artificial speech recognition experiment is introduced as a convenient means of assessing alignment accuracy. By this measure, alignment accuracy is found to be improved considerably by applying certain speaker adaptation transformations to the synthetic speech, by modifying the spectrum similarity metric, and by generating the synthetic spectra directly from the control parameters using simplified excitation spectra. The improvements seem to limit, however, at a level below that found between natural speakers. It is conjectured that further improvement requires modifications to the synthesis rules themselves.
Keywords :
Councils; Frequency; Humans; Labeling; Natural languages; Speaker recognition; Speech analysis; Speech processing; Speech recognition; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '84.
Type :
conf
DOI :
10.1109/ICASSP.1984.1172424
Filename :
1172424
Link To Document :
بازگشت