• DocumentCode
    2769745
  • Title

    Multi-stream dialect classification using SVM-GMM hybrid classifiers

  • Author

    Chitturi, Rahul ; Hansen, John H L

  • Author_Institution
    Univ. of Texas, Dallas
  • fYear
    2007
  • fDate
    9-13 Dec. 2007
  • Firstpage
    431
  • Lastpage
    436
  • Abstract
    In this paper, we investigate two important issues that influence dialect classification: (i) exploring dialect dependent features, and (ii) an effective way of combining spectral, excitation, and vocal tract information to improve dialect classification. The motivation is that dialect dependent features such as formants, LSP (line spectral pairs) and MEPZ (MFCCs + energy + pitch) span a wider range of speech production traits and are therefore better suited than traditional MFCCs for characterizing dialects. After establishing the proposed algorithm, we compare individual performances of each feature on a corpus of three dialects of Spanish. Next, we present a method for combining these features using GMM-SVM hybrid classifiers. The final combined system achieves a 30% relative improvement in dialect classification accuracy, confirming that the proposed advances significantly outperform conventional methods for dialect classification.
  • Keywords
    Bayes methods; Gaussian processes; feature extraction; natural language processing; signal classification; speech recognition; support vector machines; Bayesian-GMM scheme; Gaussian mixture model; dialect dependent feature; multistream dialect classification; speech recognition; support vector machines; Automatic speech recognition; Computer science; Loudspeakers; Robustness; Spatial databases; Speech processing; Speech recognition; Support vector machine classification; Support vector machines; Testing; Dialect Classification; GMM; LSP; MEPZ; SVM;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4244-1746-9
  • Electronic_ISBN
    978-1-4244-1746-9
  • Type

    conf

  • DOI
    10.1109/ASRU.2007.4430151
  • Filename
    4430151