Generative Model of Voice in Noise for Structured Coding Applications

Author

Jinachitra, P. ; Smith, Jeffrey O.

Author_Institution

Center for Comput. Res. in Music & Acoust., Stanford Univ., CA, USA

Volume

1

fYear

2007

fDate

15-20 April 2007

Abstract

A generative model of a human voice is presented, based on many pseudo-physical considerations. For robustness, observation noise is also included in the model. An EM-algorithm framework for inference and learning is then described. An instance of approximate inference and subsequent learning presented allows an extraction of voice parameter which can be used for structured coding application. This set of parameters allows a great amount of compression as well as the flexibility in making modification to pitch, duration and breathiness, noise-free synthesis compared to other non-parametric approaches.

Keywords

expectation-maximisation algorithm; speech coding; speech enhancement; EM-algorithm; generative model; human voice; noise-free synthesis; observation noise; structured coding; subsequent learning; Acoustic noise; Application software; Audio coding; Filters; Gaussian noise; Human voice; Noise generators; Noise robustness; Speech enhancement; Speech synthesis; Structured coding; generative model of voice; parametric voice modeling; speech enhancement;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on

Conference_Location

Honolulu, HI

ISSN

1520-6149

Print_ISBN

1-4244-0727-3

Type

conf

DOI

10.1109/ICASSP.2007.366671

Filename

4217071