• DocumentCode
    1626645
  • Title

    Voice conversion based on Non-negative Matrix Factorization in noisy environments

  • Author

    Fujii, Teruya ; Aihara, Ryo ; Takashima, Ryoichi ; Takiguchi, Tetsuya ; Ariki, Yasuo

  • Author_Institution
    Grad. Sch. of Syst. Inf., Kobe Univ., Kobe, Japan
  • fYear
    2013
  • Firstpage
    495
  • Lastpage
    498
  • Abstract
    This paper presents a voice conversion (VC) technique for noisy environments. We prepared parallel exemplars (dictionary) that consist of the source and target exemplars, which have the same texts uttered by the source and target speakers. The input source signal is decomposed into the source exemplars, noise exemplars obtained from the input signal, and their weights (activities). Then, the converted signal is obtained by calculating the linear combination of the target exemplars and the weights which are calculated using the source exemplars. In the proposed method, a Gaussian Mixture Model (GMM) -based conversion method is also applied to the feature vectors generated by the sparse coding in order to compensate a mismatch between the weights of source and target exemplars. The effectiveness of this method was confirmed by comparing its effectiveness with that of a conventional method.
  • Keywords
    Gaussian processes; dictionaries; matrix decomposition; mixture models; noise (working environment); source separation; speech enhancement; GMM based conversion method; Gaussian mixture model; VC technique; dictionary; feature vectors; linear combination; noise exemplars; noisy environments; nonnegative matrix factorization; parallel exemplars; source exemplars; source speakers; sparse coding; target exemplars; target speakers; voice conversion; Cepstrum; Dictionaries; Encoding; Feature extraction; Noise; Noise measurement; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Integration (SII), 2013 IEEE/SICE International Symposium on
  • Conference_Location
    Kobe
  • Type

    conf

  • DOI
    10.1109/SII.2013.6776630
  • Filename
    6776630