DocumentCode
431644
Title
Jointly optimal time segmentation, component selection and quantization for sinusoidal coding of audio and speech
Author
Heusdens, Richard ; Jensen, Jesper
Author_Institution
Dept. of Mediamatics, Delft Univ. of Technol., Netherlands
Volume
3
fYear
2005
fDate
18-23 March 2005
Abstract
We propose a rate-distortion optimal algorithm for sinusoidal modeling of audio and speech. The algorithm determines, for a pre-specified target bit-rate, the optimal (variable-length) time segmentation, the optimal distribution of sinusoidal components over the segments and the optimal (scalar) quantizers for quantizing the sinusoid parameters. The optimization is done by jointly optimizing the segment lengths, number of sinusoids and quantizers using high-resolution quantization theory and dynamic programming techniques, which makes it possible to solve the algorithm in polynomial time. A particular advantage of the proposed method is that, given a target bit-rate, it solves the problem of finding the optimal balance between total number of sinusoids and number of bits per sinusoid.
Keywords
audio coding; dynamic programming; quantisation (signal); rate distortion theory; speech coding; dynamic programming; high-resolution quantization theory; optimal quantization; optimal time segmentation; optimization; polynomial time; rate-distortion optimal algorithm; sinusoidal audio coding; sinusoidal component selection; sinusoidal speech coding; variable-length time segmentation; Auditory system; Dynamic programming; Entropy; Frequency; Humans; Polynomials; Quantization; Rate distortion theory; Signal analysis; Speech coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8874-7
Type
conf
DOI
10.1109/ICASSP.2005.1415679
Filename
1415679
Link To Document