• DocumentCode
    2909091
  • Title

    Voice Synchronization across Heterogeneous Telephony Systems: Problem and Solutions

  • Author

    Lin, Hsiao-Pu ; Hsieh, Hung-Yun

  • Author_Institution
    Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2010
  • fDate
    23-27 May 2010
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    As IP telephony gains more popularity, interworking with conventional PSTN telephony has also gained more importance. In particular, an increasing number of new telephony services now involves both packet-switched (IP telephony) and circuit-switched (PSTN telephony) voice legs in one call session. One common problem that arises for enabling such new services is the need for synchronization of voice streams that traverse through heterogeneous telephony systems. In this paper, we first identify the key role of voice synchronization across heterogeneous telephony systems for services such as seamless handover between WLAN and cellular networks and multi-party audio conferencing with video overlay. We then explain the challenges in synchronizing circuit-switched and packet-switched voice streams, including codec distortion, packet losses, line noises, and overlapping utterances. To achieve voice synchronization, we proceed to investigate three different approaches based on digital speech processing techniques in the waveform, cepstrum, and spectrum domains. Finally, we compare the performance benefits and tradeoffs of different approaches, thus motivating further research along this direction.
  • Keywords
    IP networks; Internet telephony; cellular radio; cepstral analysis; codecs; speech processing; teleconferencing; wireless LAN; IP telephony; PSTN telephony; WLAN; cellular networks; cepstrum domain; circuit-switched voice streams; codec distortion; digital speech processing techniques; heterogeneous telephony systems; line noises; multiparty audio conferencing; overlapping utterances; packet losses; packet-switched voice streams; spectrum domain; telephony services; video overlay; voice synchronization; waveform domain; Cepstrum; Circuit noise; Codecs; Land mobile radio cellular systems; Leg; Speech processing; Streaming media; Telephony; Videoconference; Wireless LAN;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications (ICC), 2010 IEEE International Conference on
  • Conference_Location
    Cape Town
  • ISSN
    1550-3607
  • Print_ISBN
    978-1-4244-6402-9
  • Type

    conf

  • DOI
    10.1109/ICC.2010.5502433
  • Filename
    5502433