DocumentCode :
2947176
Title :
A segment based probabilistic generative model of speech
Author :
Achan, Kannan ; Roweis, Sam ; Hertzmann, Aaron ; Frey, Brendan
Author_Institution :
Dept. of Comput. Sci. and ECE, Toronto Univ., Ont., Canada
Volume :
5
fYear :
2005
fDate :
18-23 March 2005
Abstract :
We present a purely time domain approach to speech processing which identifies waveform samples at the boundaries between glottal pulse periods (in voiced speech) or at the boundaries of unvoiced segments. An efficient algorithm for inferring these boundaries and estimating the average spectra of voiced and unvoiced regions is derived from a simple probabilistic generative model. Competitive results are presented on pitch tracking, voiced/unvoiced detection and timescale modification; all these tasks and several others can be performed using the single segmentation provided by inference in the model.
Keywords :
inference mechanisms; probability; speech processing; time-domain analysis; average spectra estimation; glottal pulse period boundaries; glottal pulse periods; model inference; pitch tracking; probabilistic generative model; segment based probabilistic generative speech model; single segmentation; speech processing; time domain approach; timescale modification; unvoiced regions; unvoiced segment boundaries; voiced regions; voiced speech; voiced/unvoiced detection; waveform samples; Computer science; Filter bank; Hidden Markov models; Pulse shaping methods; Signal processing; Spectral analysis; Spectral shape; Speech enhancement; Speech processing; Time frequency analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8874-7
Type :
conf
DOI :
10.1109/ICASSP.2005.1416280
Filename :
1416280
Link To Document :
بازگشت