DocumentCode
3325546
Title
High quality voice morphing
Author
Ye, Hui ; Young, Steve
Author_Institution
Eng. Dept., Cambridge Univ., UK
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
Voice morphing is a technique for modifying a source speaker´s speech to sound as if it was spoken by some designated target speaker. Most of the recent approaches to voice morphing apply a linear transformation to the spectral envelope and pitch scaling to modify the prosody. Whilst these methods are effective, they also introduce artifacts arising from the effects of glottal coupling, phase incoherence, unnatural phase dispersion and the high spectral variance of unvoiced sounds. A practical voice morphing system must account for these if high audio quality is to be preserved. This paper describes a complete voice morphing system and the enhancements needed for dealing with the various artifacts, including a novel method for synthesising natural phase dispersion. Each technique is assessed individually and the overall performance of the system evaluated using listening tests. Overall it is found that the enhancements significantly improve speaker identification scores and perceived audio quality.
Keywords
phase estimation; speaker recognition; spectral analysis; speech enhancement; speech synthesis; enhancements; glottal coupling; high quality voice morphing; listening tests; natural phase dispersion synthesis; perceived audio quality; performance; phase incoherence; speaker identification; spectral variance; unnatural phase dispersion; unvoiced sounds; Acoustical engineering; Buildings; Degradation; Design engineering; Dispersion; Loudspeakers; Shape; Speech analysis; Speech synthesis; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1325909
Filename
1325909
Link To Document