DocumentCode
309305
Title
Two 1/f fluctuations in sustained phonation and their roles on naturalness of synthetic voice
Author
Aoki, Naofumi ; Ifukube, Tohru
Author_Institution
Res. Inst. for Electron. Sci., Hokkaido Univ., Sapporo, Japan
Volume
1
fYear
1996
fDate
13-16 Oct 1996
Firstpage
311
Abstract
Two fluctuations are always observed in sustained phonation. These have been called shimmer (perturbations of glottal pulse amplitudes) and jitter (perturbations of pitch period durations). Both are known as involuntary laryngeal behaviors during the phonation. Our study on these phenomena revealed that their characteristics from nonpathological phonation are not completely random but are regulated by the power law, that is widely known as 1/fγ fluctuation. In order to identify the roles of 1/fγ jitter and shimmer, a series of psychoacoustic experiments was conducted. In a paired comparisons test each of twenty-five subjects individually was instructed to select more human-like synthetic voices among stimuli with four different jitter patterns such as Gaussian white noise, 1/fγ fluctuation, summation of three sinusoids fluctuation, and no fluctuation. Another comparisons test about stimuli with three different shimmer patterns was conducted. The results of the experiments supported the idea that both 1/fγ jitter and shimmer are crucial cues for a synthetic voice to be perceived as natural
Keywords
1/f noise; Gaussian noise; acoustic signal processing; jitter; speech intelligibility; speech processing; speech synthesis; time series; white noise; 1/f fluctuations; 1/fγ fluctuation; Gaussian white noise; glottal pulse amplitude perturbations; human-like synthetic voices; involuntary laryngeal behaviors; jitter; nonpathological phonation; paired comparisons test; pitch period duration perturbations; power law; psychoacoustic experiments; shimmer; summation of three sinusoids fluctuation; sustained phonation; synthetic voice naturalness; Diseases; Electronic mail; Fluctuations; Frequency; Jitter; Psychology; Signal analysis; Speech synthesis; Testing; White noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Electronics, Circuits, and Systems, 1996. ICECS '96., Proceedings of the Third IEEE International Conference on
Conference_Location
Rodos
Print_ISBN
0-7803-3650-X
Type
conf
DOI
10.1109/ICECS.1996.582813
Filename
582813
Link To Document