Title :
On the utilization of overshoot effects in low-delay audio coding
Author :
Härmä, Aki ; Laine, Unto K. ; Karjalainen, Matti
Author_Institution :
Lab. of Acoust. & Audio Signal Process., Helsinki Univ. of Technol., Espoo, Finland
Abstract :
In low-delay audio coding (coding delay <5 ms) there is no time for detailed spectral modeling in the case of brief percussive sounds, e.g., the castanets, and onsets of music or speech sounds. On the other hand, it is known from psychoacoustic experiments that the ear is not accurate near the onset of a wideband sound. We study the audibility of coding errors near the onsets of musical sounds in a simulated low-delay audio codec based on frequency-warped linear prediction. It is suggested that for many musical transients it is sufficient to reproduce a rough temporal and spectral envelope of the original signal during the first 5-10 ms. Preliminary listening tests support this idea. It is proposed that the overshoot effect of hearing could be utilized efficiently in enhancing the performance of a low-delay audio coding scheme
Keywords :
audio coding; codecs; coding errors; delays; hearing; music; prediction theory; transient analysis; brief percussive sounds; castanets; coding delay; coding error audibility; ear; frequency-warped linear prediction; hearing; listening tests; low-delay audio coding; music sounds; musical transients; overshoot effects; psychoacoustic experiments; simulated low-delay audio codec; spectral envelope; speech sounds; temporal envelope; wideband sound onset; Audio coding; Codecs; Delay effects; Ear; Music; Predictive models; Psychoacoustic models; Psychology; Speech coding; Wideband;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.759815