DocumentCode
388551
Title
Time alignment of natural speech to synthetic speech
Author
Hunt, Melvyn J.
Author_Institution
National Research Council of Canada, Ont., Canada
Volume
9
fYear
1984
fDate
30742
Firstpage
65
Lastpage
68
Abstract
A capacity to carry out reliable automatic time alignment of synthetic speech to naturally produced speech offers potential benfits in speech recognition and speaker recognition as well as in synthesis itself. Phrase alignment experiments are described that indicate that alignment to synthetic speech is more difficult than alignment of speech from two natural speakers. An artificial speech recognition experiment is introduced as a convenient means of assessing alignment accuracy. By this measure, alignment accuracy is found to be improved considerably by applying certain speaker adaptation transformations to the synthetic speech, by modifying the spectrum similarity metric, and by generating the synthetic spectra directly from the control parameters using simplified excitation spectra. The improvements seem to limit, however, at a level below that found between natural speakers. It is conjectured that further improvement requires modifications to the synthesis rules themselves.
Keywords
Councils; Frequency; Humans; Labeling; Natural languages; Speaker recognition; Speech analysis; Speech processing; Speech recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '84.
Type
conf
DOI
10.1109/ICASSP.1984.1172424
Filename
1172424
Link To Document