• DocumentCode
    2957475
  • Title

    Dialog Act classification using acoustic and discourse information of MapTask Data

  • Author

    Julia, Fatema N. ; Iftekharuddin, Khan M.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Memphis, Memphis, TN
  • fYear
    2008
  • fDate
    1-8 June 2008
  • Firstpage
    1472
  • Lastpage
    1479
  • Abstract
    In this work, we analyze both acoustic and discourse information for Dialog Act (DA) classification of HCRC MapTask dataset. We extract several different acoustic features and exploit these features in a Hidden Markov Model (HMM) to classify acoustic information. For discourse feature extraction, we propose a novel parts-of-speech (POS) tagging technique that effectively reduces the dimensionality of discourse features manyfold. To classify discourse information, we exploit two classifiers such as a HMM and a Support Vector Machine (SVM) respectively. We further obtain classifier fusion between HMM and SVM to improve discourse classification. Finally, we perform an efficient decision-level classifier fusion for both acoustic and discourse information to classify twelve different DAs in HCRC MapTask data. We obtain accuracy of rate 65.2% (58.06% with cross validation) and 55.4% (51.08% with cross validation) DA classification using acoustic and discourse information respectively. Furthermore, we obtain combined accuracy of 68.6% (61.02% with cross validation) for DA classification. These accuracy rates of DA classification are comparable to previously reported results for the same HCRC MapTask dataset. In terms of average Precision and Recall, we obtain accuracy of 74.89% and 69.83% (without cross validation) respectively. Therefore, we obtain much better precision and recall rate for most of the classified DAs when compared to existing works on the same dataset.
  • Keywords
    feature extraction; hidden Markov models; pattern classification; speech processing; support vector machines; MapTask data; acoustic features; acoustic information; decision-level classifier fusion; dialog act classification; discourse feature extraction; discourse information; hidden Markov model; parts-of-speech tagging technique; support vector machine; Acoustic measurements; Acoustic testing; Data mining; Emotion recognition; Feature extraction; Hidden Markov models; Humans; Information analysis; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1098-7576
  • Print_ISBN
    978-1-4244-1820-6
  • Electronic_ISBN
    1098-7576
  • Type

    conf

  • DOI
    10.1109/IJCNN.2008.4633991
  • Filename
    4633991