DocumentCode
761426
Title
A Comparison of Four Methods for Analog Speech Privacy
Author
Jayant, Nuggehally S. ; McDermott, Barbara J. ; Christensen, Susan W. ; Quinn, Ann Marie S
Author_Institution
Bell Labs., Murray Hill, NJ
Volume
29
Issue
1
fYear
1981
fDate
1/1/1981 12:00:00 AM
Firstpage
18
Lastpage
23
Abstract
Four well-known procedures for analog speech privacy have been compared in terms of residual intelligibility, bandwidth expansion, and encoding delay. Intelligibility scores have been determined from a perceptual experiment where about 70 untrained listeners were given the task of recognizing each of 200 spoken digits that occurred in a balanced set of 50 encrypted four-digit utterances, and by averaging resulting probabilities of correct digit recognition. Bandwidth expansion has been expressed in terms of a new segmental measure that is more sensitive to short-time bandwidth manipulations than a conventional, long-time-averaged power spectrum measurement. Encoding delay is a straightforward function of analog scrambler parameters. The scrambling procedures that have been compared are sample permutation (
), block permutation (
), frequency inversion (
), and a combination of methods
and
, denoted by [
]. Sample permutations involved a contiguous set of LS (2 to 128) 8 kHz samples, while block permutations operated on a contiguous set of NB (4 to 128) speech segments each of which was LB (8 to 256) samples long. Frequency inversion is obtained by simply inverting the sign of every other Nyquist (8 kHz) sample. The parameters,
, and LB , determine residual intelligibility as well as transmission properties such as encoding delay and bandwidth. The comparisons in our study provide a quantitative justification for the popular approach [
]. For example, with
and
, although the encoding delay is as much as 128 ms, the bandwidth expansion is only about 100 Hz (using the new segmental measure), and the digit intelligibility
is 20 percent. Note that in the specific problem of recognizing ten digits, purely random (input-independent) listener responses correspond to
percent.
), block permutation (
), frequency inversion (
), and a combination of methods
and
, denoted by [
]. Sample permutations involved a contiguous set of L
, and L
]. For example, with
and
, although the encoding delay is as much as 128 ms, the bandwidth expansion is only about 100 Hz (using the new segmental measure), and the digit intelligibility
is 20 percent. Note that in the specific problem of recognizing ten digits, purely random (input-independent) listener responses correspond to
percent.Keywords
Communication system privacy; Speech transmission; Bandwidth; Communications Society; Cryptography; Data communication; Delay; Encoding; Frequency; Power measurement; Privacy; Speech;
fLanguage
English
Journal_Title
Communications, IEEE Transactions on
Publisher
ieee
ISSN
0090-6778
Type
jour
DOI
10.1109/TCOM.1981.1094870
Filename
1094870
Link To Document