• DocumentCode
    2358754
  • Title

    Speaker discrimination in a conversation

  • Author

    Brümmer, J. N L

  • Author_Institution
    Datafusion Systems, Stellenbosch, South Africa
  • fYear
    1993
  • fDate
    34187
  • Firstpage
    156
  • Lastpage
    161
  • Abstract
    A method to automatically discriminate between speakers in a conversation between two people, with no prior training have been developed. It is intended as a preprocessing stage in a speaker recognition system. The speech is preprocessed by extracting the syllable nuclei and discarding fricatives and noise. Next, a multidimensional feature set is calculated for the whole conversation and an axis in the feature space is obtained that gives good discrimination between the two speakers. Speech that reaches either extreme on this axis is choosen as belonging to the two speakers respectively. The axis is found by taking a moving average of the feature vectors over a short time interval. This has the effect of reducing the variance in all directions in the feature space. Because the averaging relatively rarely goes over two speakers, the variance changes least in the direction in the direction separating the means of the speakers. This direction of least change of variance is found by eigenvector analysis of the covariance matrices before and after averaging
  • Keywords
    speaker recognition; speech processing; averaging; conversation; covariance matrices; eigenvector analysis; feature space; moving average; multidimensional feature set; speaker discrimination; speaker recognition system; syllable nuclei extraction; variance; Analysis of variance; Covariance matrix; Data mining; Error analysis; Filters; Multidimensional systems; Speaker recognition; Speech enhancement; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Signal Processing, 1993., Proceedings of the 1993 IEEE South African Symposium on
  • Conference_Location
    Jan Smuts Airport
  • Print_ISBN
    0-7803-1292-9
  • Type

    conf

  • DOI
    10.1109/COMSIG.1993.365853
  • Filename
    365853