• DocumentCode
    1589160
  • Title

    On a pitch alteration technique of speech using the asymmetry weighted window

  • Author

    Jung, Chan- Joong ; Ham, Myung-Kyu ; Bae, Myung- Jin

  • Author_Institution
    Dept. of Inf. & Telecommun., Soongsil Univ., Seoul, South Korea
  • Volume
    2
  • fYear
    1999
  • fDate
    6/21/1905 12:00:00 AM
  • Firstpage
    1439
  • Abstract
    To use the speech as an effective communication medium between man and machine, the synthetic speech must have good quality and various voice colors. Speech synthesis coding is classified into three categories: waveform coding, source coding and hybrid coding. To obtain synthetic speech with high quality, synthesis by waveform coding is desired. However, it is difficult to alter the excitation for various voice colors in waveform coding, because it does not divide the speech into excitation and formant components. Thus it is required to alter the excitation (pitch) in waveform coding for synthesis techniques with high quality and various voice colors. This paper examines the method for both improving and indicating the problem of the PSOLA pitch alteration method. It points out the fact that the spectrum distortion appeared because the Hamming window is not appropriate to the characteristic of the glottal wave shape. Therefore the asymmetric weighted window is proposed in order to improve this defect. The experimental procedure is as follows; first, the speech is segmented by the pitch unit with the asymmetric weighted window, and then the segmented speech is synthesized. The results of an experiment with two male speakers and the two female speakers uttering the test sentences are discussed. According to the experimental results, in the case of using the asymmetric weighted window, synthesized speech of high quality with minimum spectrum distortion can be obtained from waveform coding
  • Keywords
    spectral analysis; speech coding; speech intelligibility; speech synthesis; Hamming window; PSOLA pitch alteration method; asymmetric weighted window; glottal wave shape; hybrid coding; pitch unit; segmented speech; source coding; spectrum distortion; speech coding; speech quality; synthetic speech; waveform coding; Communication effectiveness; Data analysis; Data processing; Electronic mail; Shape; Source coding; Speech analysis; Speech coding; Speech synthesis; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Military Communications Conference Proceedings, 1999. MILCOM 1999. IEEE
  • Conference_Location
    Atlantic City, NJ
  • Print_ISBN
    0-7803-5538-5
  • Type

    conf

  • DOI
    10.1109/MILCOM.1999.821441
  • Filename
    821441