DocumentCode
331806
Title
Voice conversion based on static speaker characteristics
Author
Schwardt, L.C. ; du Preez, J.A.
Author_Institution
Digital Signal Process. Group, Stellenbosch Univ., South Africa
fYear
1998
fDate
7-8 Sep 1998
Firstpage
57
Lastpage
62
Abstract
Voice conversion has recently emerged as an interesting branch of speech processing that deals with the modification of a speaker´s perceived identity. This technology has applications in speech recognition, the entertainment and security industries. This paper provides a brief introduction to current voice conversion approaches, and discusses the development of the PASS system, a parametric voice conversion algorithm based on static speaker characteristics. The system is easy to implement, requires no phonetic transcription of the speech data, and is shown to be valuable in the case where very little training data is available. Particular mention is made of the pitch extraction subsystem, which uses a novel pitch determination algorithm to ensure the robust estimation of pitch statistics
Keywords
parameter estimation; speaker recognition; speech processing; statistical analysis; PASS system; entertainment; parametric voice conversion algorithm; pitch determination algorithm; pitch extraction subsystem; pitch statistics; robust estimation; security industry; speaker identity; speech processing; speech recognition; static speaker characteristics; Data mining; Data security; Linear predictive coding; Robustness; Signal processing algorithms; Signal synthesis; Speech processing; Speech recognition; Speech synthesis; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications and Signal Processing, 1998. COMSIG '98. Proceedings of the 1998 South African Symposium on
Conference_Location
Rondebosch
Print_ISBN
0-7803-5054-5
Type
conf
DOI
10.1109/COMSIG.1998.736922
Filename
736922
Link To Document