DocumentCode
1758005
Title
HMM Based Intermediate Matching Kernel for Classification of Sequential Patterns of Speech Using Support Vector Machines
Author
Dileep, A.D. ; Sekhar, C. Chandra
Author_Institution
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Madras, Chennai, India
Volume
21
Issue
12
fYear
2013
fDate
Dec. 2013
Firstpage
2570
Lastpage
2582
Abstract
In this paper, we address the issues in the design of an intermediate matching kernel (IMK) for classification of sequential patterns using support vector machine (SVM) based classifier for tasks such as speech recognition. Specifically, we address the issues in constructing a kernel for matching sequences of feature vectors extracted from the speech signal data of utterances. The codebook based IMK and Gaussian mixture model (GMM) based IMK have been proposed earlier for matching the varying length patterns represented as sets of features vectors for tasks such as image classification and speaker recognition. These methods consider the centers of clusters and the components of GMM as the virtual feature vectors used in the design of IMK. As these methods do not use sequence information in matching the patterns, these methods are not suitable for matching sequential patterns. We propose the hidden Markov model (HMM) based IMK for matching sequential patterns of varying length. We consider two approaches to design the HMM-based IMK. In the first approach, each of the two sequences to be matched is segmented into subsequences with each subsequence aligned to a state of the HMM. Then the HMM-based IMK is constructed as a combination of state-specific GMM-based IMKs that match the subsequences aligned with the particular states of the HMM. In the second approach, the HMM-based IMK is constructed without segmenting sequences, and by matching the local feature vectors selected using the responsibility terms that account for being in a state and generating the feature vectors by a component of the GMM of that state. We study the performance of the SVM based classifiers using the proposed HMM-based IMK for recognition of isolated utterances of E-set in English alphabet and recognition of consonent-vowel segments in Hindi language.
Keywords
Gaussian processes; hidden Markov models; pattern classification; pattern matching; speech recognition; support vector machines; E-set; English alphabet; GMM based IMK; Gaussian mixture model based IMK; HMM based IMK; Hindi language; SVM based classifier; codebook based IMK; consonent-vowel segments; hidden Markov model based IMK; image classification; intermediate matching kernel; local feature vectors; matching sequences; sequential patterns; speaker recognition; speech recognition; speech signal data; subsequences; support vector machine based classifier; virtual feature vectors; Hidden Markov models; Kernel; Sequential analysis; Speech processing; Speech recognition; Support vector machines; Intermediate matching kernel; speech recognition; support vector machine; varying length sequences;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2013.2279338
Filename
6584766
Link To Document