DocumentCode
2909091
Title
Voice Synchronization across Heterogeneous Telephony Systems: Problem and Solutions
Author
Lin, Hsiao-Pu ; Hsieh, Hung-Yun
Author_Institution
Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear
2010
fDate
23-27 May 2010
Firstpage
1
Lastpage
6
Abstract
As IP telephony gains more popularity, interworking with conventional PSTN telephony has also gained more importance. In particular, an increasing number of new telephony services now involves both packet-switched (IP telephony) and circuit-switched (PSTN telephony) voice legs in one call session. One common problem that arises for enabling such new services is the need for synchronization of voice streams that traverse through heterogeneous telephony systems. In this paper, we first identify the key role of voice synchronization across heterogeneous telephony systems for services such as seamless handover between WLAN and cellular networks and multi-party audio conferencing with video overlay. We then explain the challenges in synchronizing circuit-switched and packet-switched voice streams, including codec distortion, packet losses, line noises, and overlapping utterances. To achieve voice synchronization, we proceed to investigate three different approaches based on digital speech processing techniques in the waveform, cepstrum, and spectrum domains. Finally, we compare the performance benefits and tradeoffs of different approaches, thus motivating further research along this direction.
Keywords
IP networks; Internet telephony; cellular radio; cepstral analysis; codecs; speech processing; teleconferencing; wireless LAN; IP telephony; PSTN telephony; WLAN; cellular networks; cepstrum domain; circuit-switched voice streams; codec distortion; digital speech processing techniques; heterogeneous telephony systems; line noises; multiparty audio conferencing; overlapping utterances; packet losses; packet-switched voice streams; spectrum domain; telephony services; video overlay; voice synchronization; waveform domain; Cepstrum; Circuit noise; Codecs; Land mobile radio cellular systems; Leg; Speech processing; Streaming media; Telephony; Videoconference; Wireless LAN;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications (ICC), 2010 IEEE International Conference on
Conference_Location
Cape Town
ISSN
1550-3607
Print_ISBN
978-1-4244-6402-9
Type
conf
DOI
10.1109/ICC.2010.5502433
Filename
5502433
Link To Document