Title :
Generative Model of Voice in Noise for Structured Coding Applications
Author :
Jinachitra, P. ; Smith, Jeffrey O.
Author_Institution :
Center for Comput. Res. in Music & Acoust., Stanford Univ., CA, USA
Abstract :
A generative model of a human voice is presented, based on many pseudo-physical considerations. For robustness, observation noise is also included in the model. An EM-algorithm framework for inference and learning is then described. An instance of approximate inference and subsequent learning presented allows an extraction of voice parameter which can be used for structured coding application. This set of parameters allows a great amount of compression as well as the flexibility in making modification to pitch, duration and breathiness, noise-free synthesis compared to other non-parametric approaches.
Keywords :
expectation-maximisation algorithm; speech coding; speech enhancement; EM-algorithm; generative model; human voice; noise-free synthesis; observation noise; structured coding; subsequent learning; Acoustic noise; Application software; Audio coding; Filters; Gaussian noise; Human voice; Noise generators; Noise robustness; Speech enhancement; Speech synthesis; Structured coding; generative model of voice; parametric voice modeling; speech enhancement;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.366671