• DocumentCode
    676835
  • Title

    Speaker change detection - an comparative study using support vector machines

  • Author

    Britto, J. Gladson Maria ; Kumar, Sahoo Subhendu

  • Author_Institution
    James Coll. of Eng. & Technol., Nagercoil, India
  • fYear
    2012
  • fDate
    27-29 Dec. 2012
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Speaker change detection is important for automatic segmentation of multi speaker speech data into homogeneous segments with each segment containing the data of one speaker only. Existing approaches for speaker change detection are based on the dissimilarities of the distribution of the data before and after a speaker change point. In this paper, we propose a classification based technique for speaker change detection. Patterns extracted from the data around the speaker change points are used as positive examples. Patterns extracted from the data between the speaker change points are used as negative examples. The positive and negative examples are used in training a Support Vector Machine (SVM) for speaker change detection. The trained SVM is used to scan the continuous speech signal of multispeaker data and hypothesis the points of speaker change. The extraction of fixed length patterns from speaker are given as input to the Support Vector Machine. SVMs are used to classify the speaker change points and speaker no change points using speaker features. The performance of the system is evaluated for two speaker conversations. The dataset includes three conversation for each male-male, male-female, and female-female speaker conversations.
  • Keywords
    feature extraction; learning (artificial intelligence); pattern classification; speaker recognition; support vector machines; SVM training; automatic multispeaker speech data segmentation; classification based technique; continuous speech signal scanning; data distribution dissimilarities; female-female speaker conversation; fixed length pattern extraction; homogeneous segments; male-female speaker conversation; male-male speaker conversation; pattern extraction; speaker change detection; speaker change point classification; speaker features; support vector machine training; Linear Predictive Cepstral Coefficients (LPCC); Linear Predictive Coefficients (LPC); Mel-Frequency Cepstral Coefficients (MFCC); Support Vector Machines (SVM); Weighted Linear Predictive Cepstral Coefficients (WLPCC);
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Sustainable Energy and Intelligent Systems (SEISCON 2012), IET Chennai 3rd International on
  • Conference_Location
    Tiruchengode
  • Electronic_ISBN
    978-1-84919-797-7
  • Type

    conf

  • DOI
    10.1049/cp.2012.2208
  • Filename
    6719114