DocumentCode
1694002
Title
Individuality-preserving voice conversion for articulation disorders based on non-negative matrix factorization
Author
Aihara, Ryo ; Takashima, Ryoichi ; Takiguchi, Tetsuya ; Ariki, Yasuo
Author_Institution
Grad. Sch. of Syst. Inf., Kobe Univ., Kobe, Japan
fYear
2013
Firstpage
8037
Lastpage
8040
Abstract
We present in this paper a voice conversion (VC) method for a person with an articulation disorder resulting from athetoid cerebral palsy. The movement of such speakers is limited by their athetoid symptoms, and their consonants are often unstable or unclear, which makes it difficult for them to communicate. In this paper, exemplar-based spectral conversion using Non-negative Matrix Factorization (NMF) is applied to a voice with an articulation disorder. To preserve the speaker´s individuality, we used a combined dictionary that is constructed from the source speaker´s vowels and target speaker´s consonants. Experimental results indicate that the performance of NMF-based VC is considerably better than conventional GMM-based VC.
Keywords
matrix decomposition; speech enhancement; NMF-based VC; articulation disorders; athetoid cerebral palsy; athetoid symptoms; combined dictionary; exemplar-based spectral conversion; individuality-preserving voice conversion; nonnegative matrix factorization; Dictionaries; Feature extraction; Matrix converters; Speech; Speech synthesis; Training; Articulation Disorders; Assistive Technologies; NMF; Voice Conversion; Voice Reconstruction;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639230
Filename
6639230
Link To Document