Title :
Individuality-preserving voice conversion for articulation disorders based on non-negative matrix factorization
Author :
Aihara, Ryo ; Takashima, Ryoichi ; Takiguchi, Tetsuya ; Ariki, Yasuo
Author_Institution :
Grad. Sch. of Syst. Inf., Kobe Univ., Kobe, Japan
Abstract :
We present in this paper a voice conversion (VC) method for a person with an articulation disorder resulting from athetoid cerebral palsy. The movement of such speakers is limited by their athetoid symptoms, and their consonants are often unstable or unclear, which makes it difficult for them to communicate. In this paper, exemplar-based spectral conversion using Non-negative Matrix Factorization (NMF) is applied to a voice with an articulation disorder. To preserve the speaker´s individuality, we used a combined dictionary that is constructed from the source speaker´s vowels and target speaker´s consonants. Experimental results indicate that the performance of NMF-based VC is considerably better than conventional GMM-based VC.
Keywords :
matrix decomposition; speech enhancement; NMF-based VC; articulation disorders; athetoid cerebral palsy; athetoid symptoms; combined dictionary; exemplar-based spectral conversion; individuality-preserving voice conversion; nonnegative matrix factorization; Dictionaries; Feature extraction; Matrix converters; Speech; Speech synthesis; Training; Articulation Disorders; Assistive Technologies; NMF; Voice Conversion; Voice Reconstruction;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639230