Title :
LSF mapping for voice conversion with very small training sets
Author :
Helander, Elina ; Nurminen, Jani ; Gabbouj, Moncef
Author_Institution :
Inst. of Signal Process., Tampere Univ. of Technol., Tampere
fDate :
March 31 2008-April 4 2008
Abstract :
To make voice conversion usable in practical applications, the number of training sentences should be minimized. With traditional Gaussian mixture model (GMM) based techniques small training sets lead to over-fitting and estimation problems. We propose a new approach for mapping line spectral frequencies (LSFs) representing the vocal tract. The idea is based on inherent intra-frame correlations of LSFs. For each target LSF, a separate GMM is used and only the source and target LSF elements best correlating with the current LSF are used in training. The proposed method is evaluated both objectively and in listening tests, and it is shown that the method outperforms the conventional GMM approach especially with very small training sets.
Keywords :
speech processing; statistical analysis; line spectral frequency mapping; vocal tract representation; voice conversion; Filters; Frequency conversion; Hidden Markov models; Loudspeakers; Signal processing; Speech processing; Speech synthesis; Testing; Training data; Virtual colonoscopy; line spectral frequencies; voice conversion;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518698