• DocumentCode
    134218
  • Title

    Frame correlation based autoregressive GMM method for voice conversion

  • Author

    Xian Li ; Zeng-Fu Wang

  • Author_Institution
    Dept. of Autom., Univ. of Sci. & Technol. of China, Hefei, China
  • fYear
    2014
  • fDate
    12-14 Sept. 2014
  • Firstpage
    221
  • Lastpage
    225
  • Abstract
    In this paper, we present a frame correlation based autoregressive GMM method for voice conversion. In our system, the cross-frame correlation of the source feature is modeled with augmented delta features, and the cross-frame correlation of target feature is modeled by autoregressive models. The expectation maximization (EM) algorithm is used for the model training, and a maximum likelihood parameter conversion algorithm is then employed to convert the feature of a source speaker into the one of a target speaker frame by frame. This method is consistent in training and conversion by using target feature´s cross-frame correlation explicitly at both stage. The experimental results show that the proposed method has excellent performance. The test set log probability of it is higher than the GMM-DYN (GMM with dynamic features) method, and the subjective evaluation results of it are also comparable to the GMM-DYN method. Furthermore, it is much more suitable for low-latency application.
  • Keywords
    Gaussian processes; autoregressive processes; expectation-maximisation algorithm; mixture models; speech synthesis; EM algorithm; augmented delta features; cross-frame correlation; expectation maximization algorithm; frame correlation based autoregressive GMM method; maximum likelihood parameter conversion algorithm; model training; source feature; source speaker feature; target feature cross-frame correlation; target speaker; voice conversion; Correlation; Hidden Markov models; Maximum likelihood estimation; Speech; Training; Trajectory; Vectors; speech synthesis; voice conversion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2014.6936610
  • Filename
    6936610