• DocumentCode
    324227
  • Title

    Intelligent voice smoother for silence-suppressed voice over Internet

  • Author

    Tien, Po L. ; Yuang, Maria C.

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
  • Volume
    3
  • fYear
    1998
  • fDate
    7-11 Jun 1998
  • Firstpage
    1764
  • Abstract
    For transporting voice data with silence suppression over the Internet, the problem of jitter introduced from the network often renders the speech unintelligible. It is thus indispensable to offer intramedia synchronization to remove jitter while retaining minimal playout delay. We propose a neural-network-based intra-voice synchronization mechanism, called the intelligent voice smoother (IVoS). The IVoS is composed of three components: smoother buffer, neural network (NN) traffic predictor, and constant bit rate (CBR) enforcer. Newly arriving frames, being assumed to follow a generic Markov modulated Bernoulli process (MMBP), are queued in the smoother buffer. The NN traffic predictor employs an on-line-trained backpropagation neural network (BPNN) to predict three traffic characteristics of every newly encountered talkspurt period. Based on the predicted characteristics, the CBR enforcer derives an adaptive buffering delay. It then imposes such delay on the playout of the first frame in the talkspurt period. The CBR enforcer in turn regulates CBR-based departures for the remaining frames of the talkspurt, aiming at assuring minimal mean and variance of distortion of talkspurts (DOT) and mean playout delay (PD). Simulation results reveal that, compared to three other playout approaches, IVoS achieves superior playout yielding negligible DOT and PD irrespective of traffic variation
  • Keywords
    Internet; Markov processes; adaptive systems; backpropagation; buffer storage; delays; jitter; modulation; neural nets; synchronisation; telecommunication traffic; voice communication; Internet; MMBP; Markov modulated Bernoulli process; adaptive buffering delay; backpropagation neural network; constant bit rate enforcer; distortion of talkspurts; intelligent voice smoother; intramedia synchronization; jitter; mean playout delay; neural network traffic predictor; neural-network-based intra-voice synchronization; online-trained neural network; silence suppression; silence-suppressed voice; simulation results; smoother buffer; talkspurt period; traffic characteristics; variance; voice data transport; Backpropagation; Bit rate; Delay; IP networks; Jitter; Neural networks; Speech; Telecommunication traffic; Traffic control; US Department of Transportation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, 1998. ICC 98. Conference Record. 1998 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • Print_ISBN
    0-7803-4788-9
  • Type

    conf

  • DOI
    10.1109/ICC.1998.683132
  • Filename
    683132