• DocumentCode
    761426
  • Title

    A Comparison of Four Methods for Analog Speech Privacy

  • Author

    Jayant, Nuggehally S. ; McDermott, Barbara J. ; Christensen, Susan W. ; Quinn, Ann Marie S

  • Author_Institution
    Bell Labs., Murray Hill, NJ
  • Volume
    29
  • Issue
    1
  • fYear
    1981
  • fDate
    1/1/1981 12:00:00 AM
  • Firstpage
    18
  • Lastpage
    23
  • Abstract
    Four well-known procedures for analog speech privacy have been compared in terms of residual intelligibility, bandwidth expansion, and encoding delay. Intelligibility scores have been determined from a perceptual experiment where about 70 untrained listeners were given the task of recognizing each of 200 spoken digits that occurred in a balanced set of 50 encrypted four-digit utterances, and by averaging resulting probabilities of correct digit recognition. Bandwidth expansion has been expressed in terms of a new segmental measure that is more sensitive to short-time bandwidth manipulations than a conventional, long-time-averaged power spectrum measurement. Encoding delay is a straightforward function of analog scrambler parameters. The scrambling procedures that have been compared are sample permutation ( S ), block permutation ( B ), frequency inversion ( F ), and a combination of methods B and F , denoted by [ BF ]. Sample permutations involved a contiguous set of LS(2 to 128) 8 kHz samples, while block permutations operated on a contiguous set of NB(4 to 128) speech segments each of which was LB(8 to 256) samples long. Frequency inversion is obtained by simply inverting the sign of every other Nyquist (8 kHz) sample. The parameters, L_{s},N_{B} , and LB, determine residual intelligibility as well as transmission properties such as encoding delay and bandwidth. The comparisons in our study provide a quantitative justification for the popular approach [ BF ]. For example, with N_{B} = 8 and L_{B} =128 , although the encoding delay is as much as 128 ms, the bandwidth expansion is only about 100 Hz (using the new segmental measure), and the digit intelligibility I is 20 percent. Note that in the specific problem of recognizing ten digits, purely random (input-independent) listener responses correspond to I = 10 percent.
  • Keywords
    Communication system privacy; Speech transmission; Bandwidth; Communications Society; Cryptography; Data communication; Delay; Encoding; Frequency; Power measurement; Privacy; Speech;
  • fLanguage
    English
  • Journal_Title
    Communications, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0090-6778
  • Type

    jour

  • DOI
    10.1109/TCOM.1981.1094870
  • Filename
    1094870