Title :
Activity-mapping non-negative matrix factorization for exemplar-based voice conversion
Author :
Aihara, Ryo ; Takiguchi, Tetsuya ; Ariki, Yasuo
Author_Institution :
Grad. Sch. of Syst. Inf., Kobe Univ., Kobe, Japan
Abstract :
Voice conversion (VC) is being widely researched in the field of speech processing because of increased interest in using such processing in applications such as personalized Text-To-Speech systems. We present in this paper an exemplar-based VC method using Non-negative Matrix Factorization (NMF), which is different from conventional statistical VC. In our previous exemplar-based VC method, input speech is represented by the source dictionary and its sparse coefficients. The source and the target dictionaries are fully coupled and the converted voice is constructed from the source coefficients and the target dictionary. In this paper, we propose an Activity-mapping NMF approach and introduce mapping matrices between source and target sparse coefficients. The effectiveness of this method was confirmed by comparing its effectiveness with that of a conventional Gaussian Mixture Model (GMM)-based method and a conventional NMF-based method.
Keywords :
compressed sensing; matrix decomposition; signal representation; speech processing; GMM-based method; Gaussian mixture model-based method; activity-mapping NMF approach; converted voice; exemplar-based VC method; mapping matrices; nonnegative matrix factorization; personalized text-to-speech systems; source coefficients; source dictionary; sparse coefficients; speech processing; target dictionary; voice conversion; Dictionaries; Estimation; Gaussian mixture model; Matrix converters; Sparse matrices; Speech; Training data; NMF; non-negative matrix factorization; sparse representation; voice conversion;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178902