DocumentCode :
3584996
Title :
Voice conversion using deep neural networks with speaker-independent pre-training
Author :
Mohammadi, Seyed Hamidreza ; Kain, Alexander
Author_Institution :
Center for Spoken Language Understanding, Oregon Health & Sci. Univ., Portland, OR, USA
fYear :
2014
Firstpage :
19
Lastpage :
23
Abstract :
In this study, we trained a deep autoencoder to build compact representations of short-term spectra of multiple speakers. Using this compact representation as mapping features, we then trained an artificial neural network to predict target voice features from source voice features. Finally, we constructed a deep neural network from the trained deep autoencoder and artificial neural network weights, which were then fine-tuned using back-propagation. We compared the proposed method to existing methods using Gaussian mixture models and frame-selection. We evaluated the methods objectively, and also conducted perceptual experiments to measure both the conversion accuracy and speech quality of selected systems. The results showed that, for 70 training sentences, frame-selection performed best, regarding both accuracy and quality. When using only two training sentences, the pre-trained deep neural network performed best, regarding both accuracy and quality.
Keywords :
Gaussian processes; backpropagation; mixture models; neural nets; signal representation; speaker recognition; Gaussian mixture models; artificial neural network training; back-propagation; conversion accuracy; deep autoencoder; deep neural network; frame-selection; mapping features; source voice features; speaker short-term spectra compact representation; speaker-independent pretraining; speech quality; target voice feature prediction; voice conversion; Accuracy; Artificial neural networks; Conferences; Speech; Speech processing; Training; autoencoder; deep neural network; pre-training; voice conversion;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
Type :
conf
DOI :
10.1109/SLT.2014.7078543
Filename :
7078543
Link To Document :
بازگشت