• DocumentCode
    2174503
  • Title

    High accurate model-integration-based voice conversion using dynamic features and model structure optimization

  • Author

    Saito, Daisuke ; Watanabe, Shinji ; Nakamura, Atsushi ; Minematsu, Nobuaki

  • Author_Institution
    Univ. of Tokyo, Tokyo, Japan
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4576
  • Lastpage
    4579
  • Abstract
    This paper combines a parameter generation algorithm and a model optimization approach with the model-integration-based voice con version (MIVC). We have proposed probabilistic integration of a joint density model and a speaker model to mitigate a requirement of the parallel corpus in voice conversion (VC) based on Gaussian Mixture Model (GMM). As well as the other VC methods, MIVC also suffers from the problems; the degradation of the perceptual quality caused by the discontinuity through the parameter trajectory, and the difficulty to optimize the model structure. To solve the problems, this paper proposes a parameter generation algorithm constrained by dynamic features for the first problem and an information criterion including mutual influences between the joint density model and the speaker model for the second problem. Experimental results show that the first approach improved the performance of VC and the second approach appropriately predicted the optimal number of mixtures of the speaker model for our MIVC.
  • Keywords
    Gaussian distribution; integration; speaker recognition; GMM; Gaussian mixture model; MIVC; VC methods; dynamic feature; high accurate model-integration-based voice conversion; joint density model; model structure optimization; parameter generation algorithm; perceptual quality; speaker model; Adaptation models; Data models; Equations; Joints; Mathematical model; Optimization; Speech; Voice conversion; dynamic features; information criterion; probabilistic integration;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947373
  • Filename
    5947373