DocumentCode :
463447
Title :
Generative Model of Voice in Noise for Structured Coding Applications
Author :
Jinachitra, P. ; Smith, Jeffrey O.
Author_Institution :
Center for Comput. Res. in Music & Acoust., Stanford Univ., CA, USA
Volume :
1
fYear :
2007
fDate :
15-20 April 2007
Abstract :
A generative model of a human voice is presented, based on many pseudo-physical considerations. For robustness, observation noise is also included in the model. An EM-algorithm framework for inference and learning is then described. An instance of approximate inference and subsequent learning presented allows an extraction of voice parameter which can be used for structured coding application. This set of parameters allows a great amount of compression as well as the flexibility in making modification to pitch, duration and breathiness, noise-free synthesis compared to other non-parametric approaches.
Keywords :
expectation-maximisation algorithm; speech coding; speech enhancement; EM-algorithm; generative model; human voice; noise-free synthesis; observation noise; structured coding; subsequent learning; Acoustic noise; Application software; Audio coding; Filters; Gaussian noise; Human voice; Noise generators; Noise robustness; Speech enhancement; Speech synthesis; Structured coding; generative model of voice; parametric voice modeling; speech enhancement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366671
Filename :
4217071
Link To Document :
بازگشت