DocumentCode
2801915
Title
Sound texture modelling with linear prediction in both time and frequency domains
Author
Athineos, M. ; Ellis, Daniel P. W.
Author_Institution
Columbia Univ., New York, NY, USA
fYear
2003
fDate
19-22 Oct. 2003
Firstpage
50
Abstract
Summary form only given. Sound textures, for instance a crackling fire, running water, or applause, constitute a large and largely neglected class of audio signals. Whereas tonal sounds have been effectively and flexibly modelled with sinusoids, aperiodic energy is usually modelled as white noise filtered to match the approximate spectrum of the original over 10-30 ms windows, which fails to provide a perceptually satisfying reproduction of many real-world noisy sound textures. We attribute this failure to the loss of short-term temporal structure, and we introduce a second modelling stage in which the time envelope of the residual from conventional linear predictive modelling is itself modelled with linear prediction in the spectral domain. This cascaded time- and frequency-domain linear prediction (CTFLP) leads to noise-excited resyntheses that have high perceptual fidelity. We perform a novel quantitative error analysis by measuring the proportional error within time-frequency cells across a range of timescales.
Keywords
audio signal processing; error analysis; prediction theory; signal synthesis; sound reproduction; spectral-domain analysis; time-frequency analysis; 10 to 30 ms; applause; audio signals; cascaded time/frequency-domain linear prediction; crackling fire; error analysis; frequency domain; linear predictive modelling; noise-excited resyntheses; running water; sound texture modelling; sound texture reproduction; spectral domain; time domain; time-frequency cells; tonal sounds; white noise; Predictive models;
fLanguage
English
Publisher
ieee
Conference_Titel
Applications of Signal Processing to Audio and Acoustics, 2003 IEEE Workshop on.
Print_ISBN
0-7803-7850-4
Type
conf
DOI
10.1109/ASPAA.2003.1285816
Filename
1285816
Link To Document