DocumentCode :
26122
Title :
GMM-Based Intermediate Matching Kernel for Classification of Varying Length Patterns of Long Duration Speech Using Support Vector Machines
Author :
Dileep, A.D. ; Sekhar, C. Chandra
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Madras, Chennai, India
Volume :
25
Issue :
8
fYear :
2014
fDate :
Aug. 2014
Firstpage :
1421
Lastpage :
1432
Abstract :
Dynamic kernel (DK)-based support vector machines are used for the classification of varying length patterns. This paper explores the use of intermediate matching kernel (IMK) as a DK for classification of varying length patterns of long duration speech represented as sets of feature vectors. The main issue in construction of IMK is the choice for the set of virtual feature vectors used to select the local feature vectors for matching. This paper proposes to use components of class-independent Gaussian mixture model (CIGMM) as a representation for the set of virtual feature vectors. For every component of CIGMM, a local feature vector each from the two sets of local feature vectors that has the highest probability of belonging to that component is selected and a base kernel is computed between the selected local feature vectors. The IMK is computed as the sum of all the base kernels corresponding to different components of CIGMM. It is proposed to use the responsibility term weighted base kernels in computation of IMK to improve its discrimination ability. This paper also proposes the posterior probability weighted DKs (including the proposed IMKs) to improve their classification performance and reduce the number of support vectors. The performance of the support vector machine (SVM)-based classifiers using the proposed IMKs is studied for speech emotion recognition and speaker identification tasks and compared with that of the SVM-based classifiers using the state-of-the-art DKs.
Keywords :
Gaussian processes; feature selection; mixture models; pattern matching; probability; signal classification; speaker recognition; support vector machines; CIGMM; DK-based support vector machines; GMM-based intermediate matching kernel; IMK; SVM-based classifiers; class-independent Gaussian mixture model; classification performance; discrimination ability; dynamic kernel-based support vector machines; local feature vectors selection; long duration speech representation; posterior probability weighted DK; responsibility term weighted base kernels; speaker identification tasks; speech emotion recognition; varying length patterns classification; virtual feature vectors; Kernel; Phase shift keying; Probabilistic logic; Speech; Support vector machines; Training data; Vectors; Intermediate matching kernel (IMK); long duration speech; speaker recognition; speech emotion recognition (SER); support vector machine (SVM); varying length pattern; varying length pattern.;
fLanguage :
English
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
2162-237X
Type :
jour
DOI :
10.1109/TNNLS.2013.2293512
Filename :
6684281
Link To Document :
بازگشت