DocumentCode
2352483
Title
Modelling Speech Quality for NB and WB SILK Codec for VoIP Applications
Author
Goudarzi, Mohammad ; Sun, Lingfen ; Ifeachor, Emmanuel
Author_Institution
Sch. of Comput. & Math., Univ. of Plymouth, Plymouth, UK
fYear
2011
fDate
14-16 Sept. 2011
Firstpage
42
Lastpage
47
Abstract
In the last decade, VoIP telephony has gained a tremendous popularity. Skype is one of the most successful and popular VoIP services which has inspired a new generation of VoIP and multimedia users. SILK speech codec is the latest development by Skype and has been integrated into the current version of Skype and is expected to be incorporated into new and emerging mobile devices such as iphone and soft phones. One of the major challenges in every VoIP service is to find an easily accessible objective quality model to predict/measure the perceived speech quality or the degree of user satisfaction. In this paper, we present a regression-based model to quantify the speech quality of the wideband (WB) and narrowband (NB) SILK codec for VoIP applications. The developed model uses the network level parameter (i.e., packet loss) and the application level parameter (i.e., send bit rate) to predict the perceived voice quality in terms of the Mean Opinion Score (MOS). Subjective tests were also carried out to validate the model and good accuracy was achieved (97% for wideband and 91% for narrowband). The developed model can be easily implemented in soft phones or mobile devices to predict voice quality for SILK codec in VoIP applications and can also be used for real-time adaptation and control of VoIP applications to further explore the adaptive feature of the SILK codec in future mobile devices or softphones.
Keywords
Internet telephony; mobile handsets; regression analysis; speech codecs; Iphone; NB codec; Skype; VoIP telephony; WB SILK speech codec; mean opinion score; mobile devices; narrowband SILK codec; regression-based model; soft phones; speech quality modelling; wideband SILK codec; Bit rate; Codecs; Narrowband; Niobium; Predictive models; Speech; Wideband; PESQ; PESQ-WB; SILK codec; e-model; modelling; subjective testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Next Generation Mobile Applications, Services and Technologies (NGMAST), 2011 5th International Conference on
Conference_Location
Cardiff
ISSN
2161-2889
Print_ISBN
978-1-4577-1080-3
Type
conf
DOI
10.1109/NGMAST.2011.18
Filename
6082014
Link To Document