DocumentCode
667507
Title
Comparison of windowing in speech and audio coding
Author
Backstrom, Tom
Author_Institution
Int. Audio Labs. Erlangen, Erlangen, Germany
fYear
2013
fDate
20-23 Oct. 2013
Firstpage
1
Lastpage
4
Abstract
Speech and audio coding have during the last decade converged to an increasingly unified technology. This contribution discusses one of the remaining fundamental differences between speech and audio paradigms, namely, windowing of the input signal. Audio codecs generally use lapped transforms and apply a perceptual model in the transform domain, whereby temporal continuity is achieved by windowing and overlap-add. Speech codecs on the other hand achieve temporal continuity by using linear predictive filtering, whereby windowing is applied in the residual domain. Despite these fundamental differences, we demonstrate that the two windowing approaches, combined with perceptual modeling, perform very similarly both in terms of perceptual quality and theoretical properties.
Keywords
audio coding; speech codecs; speech coding; audio codecs; audio coding; audio paradigms; linear predictive filtering; perceptual model; perceptual quality; residual domain; speech codecs; speech coding; speech paradigms; temporal continuity; transform domain; Audio coding; Codecs; Delays; Quantization (signal); Speech; Speech coding; Transforms; audio coding; lapped orthogonal transform; perceptual modelling; speech coding; windowing;
fLanguage
English
Publisher
ieee
Conference_Titel
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on
Conference_Location
New Paltz, NY
ISSN
1931-1168
Type
conf
DOI
10.1109/WASPAA.2013.6701853
Filename
6701853
Link To Document