• DocumentCode
    394721
  • Title

    Sound texture modelling with linear prediction in both time and frequency domains

  • Author

    Athineos, Marios ; Ellis, Daniel P W

  • Author_Institution
    Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
  • Volume
    5
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    Sound textures - for instance, a crackling fire, running water, or applause - constitute a large and largely neglected class of audio signals. Whereas tonal sounds have been effectively and flexibly modelled with sinusoids, aperiodic energy is usually modelled as white noise filtered to match the approximate spectrum of the original over 10-30 ms windows, which fails to provide a perceptually satisfying reproduction of many real-world noisy sound textures. We attribute this failure to the loss of short-term temporal structure, and we introduce a second modelling stage in which the time envelope of the residual from conventional linear predictive modelling is itself modelled with linear prediction in the spectral domain. This cascade time- and frequency-domain linear prediction (CTFLP) leads to noise-excited resyntheses that have high perceptual fidelity. We perform a novel quantitative error analysis by measuring the proportional error within time-frequency cells across a range of timescales.
  • Keywords
    audio signal processing; error analysis; prediction theory; sound reproduction; spectral analysis; time-frequency analysis; aperiodic energy; audio signals; cascade time- and frequency-domain linear prediction; noise-excited resyntheses; perceptual fidelity; proportional error measurement; quantitative error analysis; short-term temporal structure; sound reproduction; sound texture modelling; spectral domain; time-frequency cells; Acoustic noise; Error analysis; Fires; Frequency domain analysis; Matched filters; Music; Performance evaluation; Predictive models; Speech; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1200054
  • Filename
    1200054