DocumentCode
1626645
Title
Voice conversion based on Non-negative Matrix Factorization in noisy environments
Author
Fujii, Teruya ; Aihara, Ryo ; Takashima, Ryoichi ; Takiguchi, Tetsuya ; Ariki, Yasuo
Author_Institution
Grad. Sch. of Syst. Inf., Kobe Univ., Kobe, Japan
fYear
2013
Firstpage
495
Lastpage
498
Abstract
This paper presents a voice conversion (VC) technique for noisy environments. We prepared parallel exemplars (dictionary) that consist of the source and target exemplars, which have the same texts uttered by the source and target speakers. The input source signal is decomposed into the source exemplars, noise exemplars obtained from the input signal, and their weights (activities). Then, the converted signal is obtained by calculating the linear combination of the target exemplars and the weights which are calculated using the source exemplars. In the proposed method, a Gaussian Mixture Model (GMM) -based conversion method is also applied to the feature vectors generated by the sparse coding in order to compensate a mismatch between the weights of source and target exemplars. The effectiveness of this method was confirmed by comparing its effectiveness with that of a conventional method.
Keywords
Gaussian processes; dictionaries; matrix decomposition; mixture models; noise (working environment); source separation; speech enhancement; GMM based conversion method; Gaussian mixture model; VC technique; dictionary; feature vectors; linear combination; noise exemplars; noisy environments; nonnegative matrix factorization; parallel exemplars; source exemplars; source speakers; sparse coding; target exemplars; target speakers; voice conversion; Cepstrum; Dictionaries; Encoding; Feature extraction; Noise; Noise measurement; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
System Integration (SII), 2013 IEEE/SICE International Symposium on
Conference_Location
Kobe
Type
conf
DOI
10.1109/SII.2013.6776630
Filename
6776630
Link To Document