DocumentCode :
2705152
Title :
Stochastic Modeling and Quantization of Harmonic Phases in Speech using Wrapped Gaussian Mixture Models
Author :
Agiomyrgiannakis, Yannis ; Stylianou, Yannis
Author_Institution :
Dept. of Comput. Sci., Crete Univ., Greece
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
Harmonic sinusoidal representations of speech have proven to be useful in many speech processing tasks. This work focuses on the phase spectra of the harmonics and provides a methodology to analyze and subsequently to model the statistics of the harmonic phases. To do so, we propose the use of a wrapped Gaussian mixture model (WGMM), a model suitable for random variables that belong to circular spaces, and provide an expectation-maximization algorithm for training. The WGMM is then used to construct a phase quantizer. The quantizer is employed in a prototype variable rate narrow-band VoIP sinusoidal codec that is equivalent to iLBC in terms of PESQ-MOS, at ~13 kbps.
Keywords :
Gaussian processes; expectation-maximisation algorithm; speech coding; expectation-maximization algorithm; harmonic phases quantization; phase spectra; speech harmonic sinusoidal representations; speech processing; stochastic modeling; wrapped Gaussian mixture models; Codecs; Expectation-maximization algorithms; Harmonic analysis; Narrowband; Prototypes; Quantization; Random variables; Speech processing; Statistical analysis; Stochastic processes; phase coding; source coding; speech analysis; speech coding; transform coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367271
Filename :
4218302
Link To Document :
بازگشت