DocumentCode :
3421962
Title :
Stylization of pitch with syllable-based linear segments
Author :
Ravuri, Suman ; Ellis, Daniel P W
Author_Institution :
Dept. of Electr. Eng., Columbia Univ., New York, NY
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
3985
Lastpage :
3988
Abstract :
Fundamental frequency contours for speech, as obtained by common pitch tracking algorithms, contain a great deal of fine detail that is unlikely to hold much perceptual significance for listeners. In our experiments, a radically reduced pitch contour consisting of a single linear segment for each syllable was found to judged as equally natural as the original pitch track by listeners, based on high-quality analysis- synthesis. We describe the algorithms both for segmenting speech into syllables based on fitting Gaussians to the energy envelope, and for approximating the pitch contour by independent linear segments for each syllable. We report our web-based test in which 40 listeners compared the stylized pitch contour resyntheses to equivalent resyntheses based on the original pitch track, and also to pitch tracks stylized by the existing Momel algorithm. Listeners preferred the original pitch contour to the linear approximation in only 60% of cases, where 50% would indicate random guessing. By contrast, the original was preferred over Momel in 74% of cases.
Keywords :
Gaussian processes; approximation theory; speech processing; speech synthesis; Gaussian fitting; Momel algorithm; energy envelope; fundamental frequency contours; linear approximation; pitch stylization; speech; speech segmentation; stylized pitch contour resyntheses; syllable-based linear segments; syllables; Frequency estimation; Gaussian approximation; Information analysis; Linear approximation; Piecewise linear approximation; Software standards; Speech analysis; Speech coding; Speech processing; Testing; Piecewise linear approximation; Speech analysis; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518527
Filename :
4518527
Link To Document :
بازگشت