DocumentCode :
3327087
Title :
Enhancement of mismatched conditions in speaker recognition for multimedia applications
Author :
Fakhr, Waleed ; AbdelSalam, Ahmed ; Hamdy, Nadder
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
The paper investigates the performance of an HMM-based text-independent speaker recognition system under different model and feature combinations for matched and mismatched speech coding conditions. The effects of changing the HMM topology and acoustic features is first investigated. Training and testing the models using only the voiced segments of the samples is then considered. The best model structure in each topology is then used to test the effects of speech codecs like G729 at 8 kb/s and G723.1 at 5.3 and 6.3 kb/s, used in multimedia applications, on the performance of both matched and mismatched conditions. To improve the performance in mismatched conditions, a MAP-based adaptation with different amounts of coded training data and a diagonal affine transform for adapting the coded cepstral features to the original PCM cepstral features are investigated. Results show that the proposed techniques improve speaker recognition performance and produce comparable results to the matched condition test.
Keywords :
cepstral analysis; hidden Markov models; multimedia systems; pulse code modulation; speaker recognition; speech coding; transforms; 5.3 to 8 kbit/s; G723.1; G729; HMM topology; PCM; acoustic features; cepstral features; diagonal affine transform; mismatched conditions; multimedia applications; speaker recognition; speech codecs; speech coding conditions; text-independent speaker recognition system; training data; Acoustic testing; Cepstral analysis; Hidden Markov models; Multimedia systems; Phase change materials; Speaker recognition; Speech codecs; Speech coding; Speech recognition; Topology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326001
Filename :
1326001
Link To Document :
بازگشت