Jointly optimal time segmentation, component selection and quantization for sinusoidal coding of audio and speech

Author

Heusdens, Richard ; Jensen, Jesper

Author_Institution

Dept. of Mediamatics, Delft Univ. of Technol., Netherlands

Volume

3

fYear

2005

fDate

18-23 March 2005

Abstract

We propose a rate-distortion optimal algorithm for sinusoidal modeling of audio and speech. The algorithm determines, for a pre-specified target bit-rate, the optimal (variable-length) time segmentation, the optimal distribution of sinusoidal components over the segments and the optimal (scalar) quantizers for quantizing the sinusoid parameters. The optimization is done by jointly optimizing the segment lengths, number of sinusoids and quantizers using high-resolution quantization theory and dynamic programming techniques, which makes it possible to solve the algorithm in polynomial time. A particular advantage of the proposed method is that, given a target bit-rate, it solves the problem of finding the optimal balance between total number of sinusoids and number of bits per sinusoid.

Keywords

audio coding; dynamic programming; quantisation (signal); rate distortion theory; speech coding; dynamic programming; high-resolution quantization theory; optimal quantization; optimal time segmentation; optimization; polynomial time; rate-distortion optimal algorithm; sinusoidal audio coding; sinusoidal component selection; sinusoidal speech coding; variable-length time segmentation; Auditory system; Dynamic programming; Entropy; Frequency; Humans; Polynomials; Quantization; Rate distortion theory; Signal analysis; Speech coding;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on

ISSN

1520-6149

Print_ISBN

0-7803-8874-7

Type

conf

DOI

10.1109/ICASSP.2005.1415679

Filename

1415679