DocumentCode
2358754
Title
Speaker discrimination in a conversation
Author
Brümmer, J. N L
Author_Institution
Datafusion Systems, Stellenbosch, South Africa
fYear
1993
fDate
34187
Firstpage
156
Lastpage
161
Abstract
A method to automatically discriminate between speakers in a conversation between two people, with no prior training have been developed. It is intended as a preprocessing stage in a speaker recognition system. The speech is preprocessed by extracting the syllable nuclei and discarding fricatives and noise. Next, a multidimensional feature set is calculated for the whole conversation and an axis in the feature space is obtained that gives good discrimination between the two speakers. Speech that reaches either extreme on this axis is choosen as belonging to the two speakers respectively. The axis is found by taking a moving average of the feature vectors over a short time interval. This has the effect of reducing the variance in all directions in the feature space. Because the averaging relatively rarely goes over two speakers, the variance changes least in the direction in the direction separating the means of the speakers. This direction of least change of variance is found by eigenvector analysis of the covariance matrices before and after averaging
Keywords
speaker recognition; speech processing; averaging; conversation; covariance matrices; eigenvector analysis; feature space; moving average; multidimensional feature set; speaker discrimination; speaker recognition system; syllable nuclei extraction; variance; Analysis of variance; Covariance matrix; Data mining; Error analysis; Filters; Multidimensional systems; Speaker recognition; Speech enhancement; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications and Signal Processing, 1993., Proceedings of the 1993 IEEE South African Symposium on
Conference_Location
Jan Smuts Airport
Print_ISBN
0-7803-1292-9
Type
conf
DOI
10.1109/COMSIG.1993.365853
Filename
365853
Link To Document