DocumentCode
3498410
Title
Probabilistic Approach for Speaker Transformation
Author
Gao Yin-qiu ; Yang Zhen
Author_Institution
Inst. of Signal & Inf. Process., Nanjing Univ. of Posts & Telecommun., Nanjing
fYear
2007
fDate
21-25 Sept. 2007
Firstpage
2845
Lastpage
2848
Abstract
A probabilistic approach of speaker transformation is proposed in this paper to make the speech of a source speaker sound like uttered by a target speaker. Speaker individuality transformation is achieved by altering characteristics of the speech spectrum and the supersegmental information such as fundamental pitch frequency. The main advantage of this scheme lies in the aspect of not only having considered the statistical property of both the source and target speech spectrum but also the relationship between them under a cross correlational model. And to make sure that the transformed speech signals are perceptually closer to the target speaker, prosody modification is also involved. The proposed scheme is evaluated using both subjective and objective measures. The experimental results show that the transformation system put forward is capable of effectively transforming speaker identity whilst the converted speech maintains high quality. And the whole performance is evaluated to be superior to the conventional vector quantization (VQ) based method.
Keywords
probability; speech processing; conventional vector quantization; probabilistic approach; speaker transformation; speech spectrum; supersegmental information; Frequency; Information processing; Interpolation; Loudspeakers; Oral communication; Robustness; Signal processing; Speech enhancement; Speech synthesis; Vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Wireless Communications, Networking and Mobile Computing, 2007. WiCom 2007. International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-1311-9
Type
conf
DOI
10.1109/WICOM.2007.706
Filename
4340481
Link To Document