DocumentCode
676835
Title
Speaker change detection - an comparative study using support vector machines
Author
Britto, J. Gladson Maria ; Kumar, Sahoo Subhendu
Author_Institution
James Coll. of Eng. & Technol., Nagercoil, India
fYear
2012
fDate
27-29 Dec. 2012
Firstpage
1
Lastpage
5
Abstract
Speaker change detection is important for automatic segmentation of multi speaker speech data into homogeneous segments with each segment containing the data of one speaker only. Existing approaches for speaker change detection are based on the dissimilarities of the distribution of the data before and after a speaker change point. In this paper, we propose a classification based technique for speaker change detection. Patterns extracted from the data around the speaker change points are used as positive examples. Patterns extracted from the data between the speaker change points are used as negative examples. The positive and negative examples are used in training a Support Vector Machine (SVM) for speaker change detection. The trained SVM is used to scan the continuous speech signal of multispeaker data and hypothesis the points of speaker change. The extraction of fixed length patterns from speaker are given as input to the Support Vector Machine. SVMs are used to classify the speaker change points and speaker no change points using speaker features. The performance of the system is evaluated for two speaker conversations. The dataset includes three conversation for each male-male, male-female, and female-female speaker conversations.
Keywords
feature extraction; learning (artificial intelligence); pattern classification; speaker recognition; support vector machines; SVM training; automatic multispeaker speech data segmentation; classification based technique; continuous speech signal scanning; data distribution dissimilarities; female-female speaker conversation; fixed length pattern extraction; homogeneous segments; male-female speaker conversation; male-male speaker conversation; pattern extraction; speaker change detection; speaker change point classification; speaker features; support vector machine training; Linear Predictive Cepstral Coefficients (LPCC); Linear Predictive Coefficients (LPC); Mel-Frequency Cepstral Coefficients (MFCC); Support Vector Machines (SVM); Weighted Linear Predictive Cepstral Coefficients (WLPCC);
fLanguage
English
Publisher
iet
Conference_Titel
Sustainable Energy and Intelligent Systems (SEISCON 2012), IET Chennai 3rd International on
Conference_Location
Tiruchengode
Electronic_ISBN
978-1-84919-797-7
Type
conf
DOI
10.1049/cp.2012.2208
Filename
6719114
Link To Document