DocumentCode :
3051323
Title :
Speaker recognition using a feature weighting technique
Author :
Ney, Hermann ; Gierloff, Rainer
Author_Institution :
Philips GmbH ForschungsLaboratorium Hamburg, F.R.G.
Volume :
7
fYear :
1982
fDate :
30072
Firstpage :
1645
Lastpage :
1648
Abstract :
This paper describes a technique for increasing the ability of a text-dependent speaker recognition system to discriminate between speaker classes; this technique is to be performed in conjunction with the nonlinear time alignment between a reference pattern and a test pattern. Unlike the standard approach, where the training of the recognition system merely consists of storing and averaging or selecting the time normalized training patterns separately for each class, the training phase of the system is extended in that a weight is determined for each individual feature component of the complete reference pattern according to the ability of the feature to distinguish between speaker classes. The weights depend on the time axis as well as on the frequency axis. The overall distance computed after nonlinear time alignment between a reference pattern and a test pattern thus becomes a function of the given set of weights of the reference class considered. For each class, the optimum weights result from the ideal criterion of minimum error rate. Instead of this criterion, the closely related but mathematically more convenient Fisher criterion is used that leads to a closed from solution for the unknown weights. Based on these weights, the selection of subsets of effective features is studied in order to further improve the class discrimination. The feature weighting and selecting techniques are tested using a data base of utterances recorded off dialed-up telephone lines. The experiments indicate that feature weighting and feature selection can reduce the error rates by a factor of two or more both for speaker identification and speaker verification.
Keywords :
Closed-form solution; Dynamic programming; Error analysis; Frequency; Pattern recognition; Speaker recognition; Speech; System testing; Telephony; Tiles;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
Type :
conf
DOI :
10.1109/ICASSP.1982.1171489
Filename :
1171489
Link To Document :
بازگشت