• DocumentCode
    3272364
  • Title

    Statistical eigenvoice: speaker features within S+N framework and a way towards language-independent voice conversion

  • Author

    Huang, Feng ; Yin, Junxun

  • Author_Institution
    Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China
  • fYear
    2005
  • fDate
    13-16 Dec. 2005
  • Firstpage
    33
  • Lastpage
    36
  • Abstract
    This paper presents a statistical method for speaker feature extraction and voice conversion within sinusoidal + noise (S+N) modeling framework. With fundamental researches on speaker characteristics embedded in the parameter sets of S+N model, we found the vector sets of statistical eigenvoice (SEV) and weighted statistical eigenvoice (wSEV), which are basis vectors of GMM representation, have significant properties: approximately speaker-dependent and language-independent. Piered by the feature vectors of SEV and wSEV, we address a new algorithm for context-free voice conversion. Subjective tests suggest that the SEV-based method achieves convincing results while maintaining high synthesis quality in comparison to the traditional LPC approaches.
  • Keywords
    eigenvalues and eigenfunctions; speech processing; statistical analysis; voice communication; S+N framework; language-independent voice conversion; sinusoidal + noise modeling framework; speaker feature extraction; statistical eigenvoice; weighted statistical eigenvoice; Linear predictive coding; Loudspeakers; Natural languages; Signal processing algorithms; Signal synthesis; Speech analysis; Speech processing; Speech synthesis; Statistical analysis; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Signal Processing and Communication Systems, 2005. ISPACS 2005. Proceedings of 2005 International Symposium on
  • Print_ISBN
    0-7803-9266-3
  • Type

    conf

  • DOI
    10.1109/ISPACS.2005.1595339
  • Filename
    1595339