• DocumentCode
    388551
  • Title

    Time alignment of natural speech to synthetic speech

  • Author

    Hunt, Melvyn J.

  • Author_Institution
    National Research Council of Canada, Ont., Canada
  • Volume
    9
  • fYear
    1984
  • fDate
    30742
  • Firstpage
    65
  • Lastpage
    68
  • Abstract
    A capacity to carry out reliable automatic time alignment of synthetic speech to naturally produced speech offers potential benfits in speech recognition and speaker recognition as well as in synthesis itself. Phrase alignment experiments are described that indicate that alignment to synthetic speech is more difficult than alignment of speech from two natural speakers. An artificial speech recognition experiment is introduced as a convenient means of assessing alignment accuracy. By this measure, alignment accuracy is found to be improved considerably by applying certain speaker adaptation transformations to the synthetic speech, by modifying the spectrum similarity metric, and by generating the synthetic spectra directly from the control parameters using simplified excitation spectra. The improvements seem to limit, however, at a level below that found between natural speakers. It is conjectured that further improvement requires modifications to the synthesis rules themselves.
  • Keywords
    Councils; Frequency; Humans; Labeling; Natural languages; Speaker recognition; Speech analysis; Speech processing; Speech recognition; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '84.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1984.1172424
  • Filename
    1172424