• DocumentCode
    3246436
  • Title

    Using estimated formants tracks for formants smoothing in text to speech (TTS) synthesis

  • Author

    Low, Phuq Hui ; Ho, Ching-Hsiang ; Yaseghi, S.

  • Author_Institution
    Dept. of Electron. & Comput. Eng., Brunel Univ., London, UK
  • fYear
    2003
  • fDate
    30 Nov.-3 Dec. 2003
  • Firstpage
    688
  • Lastpage
    693
  • Abstract
    Spectral or formant discontinuities across successive speech segments are legacies of concatenative TTS synthesisers. In this paper, a pole analysis procedure is used to estimate the formant frequency, bandwidths, spectrum shape and dynamics. The obtained formant tracks are then used for formant smoothing purposes in TTS synthesis. This paper explores three methods of spectral and formants smoothing. The first method achieves spectral smoothing at segment boundaries by interpolating the LP autocorrelation vectors. The second and third formant smoothing methods involves direct modification of the formant frequencies. In the second method, smoothing is achieved by substituting the formants of the synthesised speech with that of the natural speech. Finally, the last method achieves formant smoothing by moving the formants of successive segments closer to an average value.
  • Keywords
    correlation methods; interpolation; poles and zeros; smoothing methods; speech synthesis; LP autocorrelation vector interpolation; TTS synthesis; concatenative TTS synthesisers; formant bandwidth; formant dynamics; formant frequency; formant spectrum shape; formant track estimation; formants smoothing; natural speech formant substitution; pole analysis procedure; spectral discontinuities; spectral smoothing; speech segment formant discontinuities; text to speech synthesis; Autocorrelation; Bandwidth; Concatenated codes; Costs; Databases; Frequency estimation; Interpolation; Psychoacoustic models; Smoothing methods; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
  • Print_ISBN
    0-7803-7980-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2003.1318523
  • Filename
    1318523