DocumentCode :
783583
Title :
Digital audio coding for visual communications
Author :
Noll, Peter
Author_Institution :
Inst. fur Fernmeldetechnik, Tech. Univ. Berlin, Germany
Volume :
83
Issue :
6
fYear :
1995
fDate :
6/1/1995 12:00:00 AM
Firstpage :
925
Lastpage :
943
Abstract :
Current and future visual communications for applications such as broadcasting videotelephony, video- and audiographic-conferencing, and interactive multimedia services assume a substantial audio component. Even text, graphics, fax, still images, email documents, etc. will gain from voice annotation and audio clips. A wide range of speech, wideband speech, and wideband audio coders is available for such applications. In the context of audiovisual communications, the quality of telephone-bandwidth speech is acceptable for some videotelephony and videoconferencing services. Higher bandwidths (wideband speech) may be necessary to improve the intelligibility and naturalness of speech. High quality audio coding including multichannel audio will be necessary in advanced digital TV and multimedia services. This paper explains basic approaches to speech, wideband speech, and audio bit rate compressions in audiovisual communications. These signal classes differ in bandwidth, dynamic range, and in listener expectation of offered quality. It will become obvious that the use of our knowledge of auditory perception helps minimizing perception of coding artifacts and leads to efficient low bit rate coding algorithms which can achieve substantially more compression than was thought possible only a few years ago. The paper concentrates on worldwide source coding standards beneficial for consumers, service providers, and manufacturers
Keywords :
audio coding; audio-visual systems; code standards; data compression; digital communication; hearing; reviews; source coding; speech coding; telecommunication standards; transform coding; visual communication; advanced digital TV; audio bit rate compressions; audio clips; audiographic conferencing; audiovisual communications; auditory perception; broadcasting videotelephony; digital audio coding; dynamic range; intelligibility; interactive multimedia services; low bit rate coding algorithms; multichannel audio; speech naturalness; telephone-bandwidth speech; videoconferencing; visual communications; voice annotation; wideband speech; worldwide source coding standards; Audio coding; Bandwidth; Bit rate; Digital audio broadcasting; Digital multimedia broadcasting; Digital video broadcasting; Multimedia communication; Speech; Visual communication; Wideband;
fLanguage :
English
Journal_Title :
Proceedings of the IEEE
Publisher :
ieee
ISSN :
0018-9219
Type :
jour
DOI :
10.1109/5.387093
Filename :
387093
Link To Document :
بازگشت