Title :
Detecting an imposter in telephone speech
Author :
Schalkwyk, Johan ; Barnard, Etienne ; Sachs, Jeffrey R.
Author_Institution :
Center for Spoken Language Understanding, Oregon Graduate Inst. of Sci. & Technol., Beaverton, OR, USA
Abstract :
This paper presents initial results on imposter detection in telephone speech. The imposter detector problem is defined in terms of a real-world security problem. Perceptual studies are then presented. These studies present a good estimate on the difficulty of the task at hand; it is found that humans classify approximately 85.6% of our benchmark utterances correctly. To design an automatic imposter detector, features which elicit speaker differences are studied. A baseline system based only on 20th order Linear Predictive Coefficients (LPC) classifies 75.0% of the test set correctly. By extracting features only in vowel and semi-vowel regions, i.e. where the all-pole model of the linear predictor is most accurate, the classification performance is increased to 80.0%. Further features such as average energy and median pitch result in a correct classification rate of 83.7% comparable to the perceptual benchmarks
Keywords :
feature extraction; neural nets; prediction theory; security of data; speaker recognition; telephony; all-pole model; automatic imposter detector; average energy; baseline system; benchmark utterances classification; classification performance; classification rate; feature extraction; linear predictive coefficients; linear predictor; median pitch; neural network; perceptual studies; security problem; semi-vowels; speaker differences; telephone speech; vowel; Benchmark testing; Computer vision; Detectors; Feature extraction; Humans; Linear predictive coding; Security; Speech; System testing; Telephony;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389328