DocumentCode
423256
Title
Prediction-based packet loss concealment for voice over IP: a statistical n-gram approach
Author
Lee, Minkyu ; Zitouni, Imed ; Zhou, Qiru
Author_Institution
Lucent Technol. Bell Labs., Murray Hill, NJ, USA
Volume
4
fYear
2004
fDate
29 Nov.-3 Dec. 2004
Firstpage
2308
Abstract
We investigate the possibility of predicting lost packets for packet loss concealment using n-gram predictive models. Unlike the conventional repetition-based algorithms, the proposed algorithm is based on the Shannon game, which serves as a principle for predicting the speech parameters of lost packets using the previously received parameters. During the training phase, we construct statistical backoff n-gram models. In the test phase, the models are used to predict the speech parameters of lost packets. Experiments were performed on a switchboard telephone speech database and the proposed algorithm is compared with the conventional repetition-based algorithm. The performance is evaluated in terms of the spectral distortion between the original and the predicted (or repeated) speech. The algorithm based on the back-off n-gram models reduces the spectral distortion by 8.7% over the conventional repetition-based algorithm for the first lost packet after receiving one. Further, it maintains about 6.2% improvement for up to six consecutive lost packets. In terms of perplexity of the predictive models, the backoff n-gram approach outperforms the repetition-based algorithm by 8.65%, which is very close to the improvement rate obtained from the spectral distortion measurement.
Keywords
Internet telephony; game theory; prediction theory; speech processing; statistical analysis; Shannon game; n-gram predictive models; prediction-based packet loss concealment; repetition-based algorithm; spectral distortion; speech parameter prediction; statistical approach; statistical backoff n-gram models; switchboard telephone speech database; voice over IP; Databases; Delay; Forward error correction; IP networks; Internet telephony; Predictive models; Programmable control; Speech; Telecommunication traffic; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Global Telecommunications Conference, 2004. GLOBECOM '04. IEEE
Print_ISBN
0-7803-8794-5
Type
conf
DOI
10.1109/GLOCOM.2004.1378420
Filename
1378420
Link To Document