• DocumentCode
    1758005
  • Title

    HMM Based Intermediate Matching Kernel for Classification of Sequential Patterns of Speech Using Support Vector Machines

  • Author

    Dileep, A.D. ; Sekhar, C. Chandra

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Madras, Chennai, India
  • Volume
    21
  • Issue
    12
  • fYear
    2013
  • fDate
    Dec. 2013
  • Firstpage
    2570
  • Lastpage
    2582
  • Abstract
    In this paper, we address the issues in the design of an intermediate matching kernel (IMK) for classification of sequential patterns using support vector machine (SVM) based classifier for tasks such as speech recognition. Specifically, we address the issues in constructing a kernel for matching sequences of feature vectors extracted from the speech signal data of utterances. The codebook based IMK and Gaussian mixture model (GMM) based IMK have been proposed earlier for matching the varying length patterns represented as sets of features vectors for tasks such as image classification and speaker recognition. These methods consider the centers of clusters and the components of GMM as the virtual feature vectors used in the design of IMK. As these methods do not use sequence information in matching the patterns, these methods are not suitable for matching sequential patterns. We propose the hidden Markov model (HMM) based IMK for matching sequential patterns of varying length. We consider two approaches to design the HMM-based IMK. In the first approach, each of the two sequences to be matched is segmented into subsequences with each subsequence aligned to a state of the HMM. Then the HMM-based IMK is constructed as a combination of state-specific GMM-based IMKs that match the subsequences aligned with the particular states of the HMM. In the second approach, the HMM-based IMK is constructed without segmenting sequences, and by matching the local feature vectors selected using the responsibility terms that account for being in a state and generating the feature vectors by a component of the GMM of that state. We study the performance of the SVM based classifiers using the proposed HMM-based IMK for recognition of isolated utterances of E-set in English alphabet and recognition of consonent-vowel segments in Hindi language.
  • Keywords
    Gaussian processes; hidden Markov models; pattern classification; pattern matching; speech recognition; support vector machines; E-set; English alphabet; GMM based IMK; Gaussian mixture model based IMK; HMM based IMK; Hindi language; SVM based classifier; codebook based IMK; consonent-vowel segments; hidden Markov model based IMK; image classification; intermediate matching kernel; local feature vectors; matching sequences; sequential patterns; speaker recognition; speech recognition; speech signal data; subsequences; support vector machine based classifier; virtual feature vectors; Hidden Markov models; Kernel; Sequential analysis; Speech processing; Speech recognition; Support vector machines; Intermediate matching kernel; speech recognition; support vector machine; varying length sequences;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2279338
  • Filename
    6584766