• DocumentCode
    2744449
  • Title

    A Novel Voice Morphing System Using Bi-GMM for High Quality Transformation

  • Author

    Xu, Ning ; Shao, Xi ; Yang, Zhen

  • Author_Institution
    Inst. of Signal Process. & Transm., Nanjing Univ. of Posts & Telecommun., Nanjing
  • fYear
    2008
  • fDate
    6-8 Aug. 2008
  • Firstpage
    485
  • Lastpage
    489
  • Abstract
    This paper presents a novel voice morphing system which reproduces high quality speech while maintaining the majority of the target characteristics. Bi-GMM is named for using GMM technique to estimate mapping functions as well as a codebook generated by GMM either. Compared with the traditional GMM technique, a maximum likelihood estimation framework combined with codebook compensation technique is proposed to overcome the overly smoothed problem caused by conventional GMM. Furthermore, in order to alleviate the discontinuities between frames, a time domain median filter is applied. The STRAIGHT algorithm is adopted for the analysis and synthesis process. The objective and subjective evaluations show that the quality of the speech converted by the proposed method is significantly improved compared with the results by the traditional GMM method.
  • Keywords
    Gaussian processes; maximum likelihood estimation; speech processing; STRAIGHT algorithm; biGaussian mixture model; codebook compensation technique; high quality speech transformation; maximum likelihood estimation; time domain median filter; voice morphing system; Artificial intelligence; Artificial neural networks; Distributed computing; Feature extraction; Filters; Maximum likelihood estimation; Software engineering; Speech analysis; Speech synthesis; Target tracking; Bi-GMM; codebook; voice morphing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2008. SNPD '08. Ninth ACIS International Conference on
  • Conference_Location
    Phuket
  • Print_ISBN
    978-0-7695-3263-9
  • Type

    conf

  • DOI
    10.1109/SNPD.2008.8
  • Filename
    4617418