• DocumentCode
    1793489
  • Title

    Sequential voice conversion using grid-based approximation

  • Author

    Benisty, Hadas ; Malah, David ; Crammer, Koby

  • Author_Institution
    Dept. of Electr. Eng., Technion - Israel Inst. of Technol., Haifa, Israel
  • fYear
    2014
  • fDate
    3-5 Dec. 2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Common voice conversion methods are based on Gaussian Mixture Modeling (GMM), which requires exhaustive training (typically lasting hours), often leading to ill-conditioning if the dataset used is too small. We propose a new conversion method that is trained in seconds, using either small or large scale datasets. The proposed Grid-Based (GB) method is based on sequential Bayesian tracking, by which the conversion process is expressed as a sequential estimation problem of tracking the target spectrum based on the observed source spectrum. The converted MFCC vectors are sequentially evaluated using a weighted sum of the target training set used as grid-points. To improve the perceived quality of the synthesized signals, we use a post-processing block for enhancing the global variance. Objective and subjective evaluations show that the enhanced-GB method is comparable to classic GMM-based methods, in terms of quality, and comparable to their enhanced versions, in terms of individuality.
  • Keywords
    Gaussian processes; mixture models; sequential estimation; speech processing; Gaussian mixture modeling; grid-based approximation; sequential Bayesian tracking; sequential estimation; sequential voice conversion; Approximation methods; Bayes methods; Estimation; Speech; Target tracking; Training; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical & Electronics Engineers in Israel (IEEEI), 2014 IEEE 28th Convention of
  • Conference_Location
    Eilat
  • Print_ISBN
    978-1-4799-5987-7
  • Type

    conf

  • DOI
    10.1109/EEEI.2014.7005872
  • Filename
    7005872