• DocumentCode
    2381663
  • Title

    Same but different? — Using speech signal features for comparing conversational VoIP quality studies

  • Author

    Egger, Sebastian ; Schatz, Raimund ; Schoenenberg, Katrin ; Raake, Alexander ; Kubin, Gernot

  • Author_Institution
    Telecommun. Res. Center Vienna (ftw), Vienna, Austria
  • fYear
    2012
  • fDate
    10-15 June 2012
  • Firstpage
    1320
  • Lastpage
    1324
  • Abstract
    In this paper we demonstrate how speech signal features can be used to detect and explain differences in human to human conversation tests. To this end, we compare the results of two conversational VoIP quality experiments designed to quantify the impact of network delay on perceived speech quality. Both studies followed the same procedures and used the same scenarios, but were conducted in two different labs. Our comparison shows that the two studies, despite having been executed correctly using the same test design, still can produce surprisingly different results regarding the users quality perception on a MOS scale. In this respect, speech signal features extracted from conversation recordings help identifying divergent participant behavior as plausible cause for such differences. Our in-depth analysis reveals how novel parameters developed by us like Intended and Unintended Interruption Rate (IIR, UIR) and the corrected Speaker Alternation Rate SARcorr can be used to successfully determine the extent to which the results of different conversational speech quality studies are directly comparable and thus eligible for pooling, or not.
  • Keywords
    Internet telephony; feature extraction; speech processing; MOS scale; SAR; VoIP quality; conversation recordings; corrected speaker alternation rate; human-to-human conversation tests; intended interruption rate; network delay; perceived speech quality; speech signal feature extraction; unintended interruption rate; user quality perception; Delay; Feature extraction; Humans; Interrupters; Reliability; Speech; Telecommunications; Conversational Interactivity; Conversational VoIP Quality Studies; Delay; Speech Signal Features;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications (ICC), 2012 IEEE International Conference on
  • Conference_Location
    Ottawa, ON
  • ISSN
    1550-3607
  • Print_ISBN
    978-1-4577-2052-9
  • Electronic_ISBN
    1550-3607
  • Type

    conf

  • DOI
    10.1109/ICC.2012.6364593
  • Filename
    6364593