مرکز منطقه ای اطلاع رساني علوم و فناوري - Dialog Act classification using acoustic and discourse information of MapTask Data

DocumentCode :

2957475

Title :

Dialog Act classification using acoustic and discourse information of MapTask Data

Author :

Julia, Fatema N. ; Iftekharuddin, Khan M.

Author_Institution :

Dept. of Electr. & Comput. Eng., Univ. of Memphis, Memphis, TN

fYear :

2008

fDate :

1-8 June 2008

Firstpage :

1472

Lastpage :

1479

Abstract :

In this work, we analyze both acoustic and discourse information for Dialog Act (DA) classification of HCRC MapTask dataset. We extract several different acoustic features and exploit these features in a Hidden Markov Model (HMM) to classify acoustic information. For discourse feature extraction, we propose a novel parts-of-speech (POS) tagging technique that effectively reduces the dimensionality of discourse features manyfold. To classify discourse information, we exploit two classifiers such as a HMM and a Support Vector Machine (SVM) respectively. We further obtain classifier fusion between HMM and SVM to improve discourse classification. Finally, we perform an efficient decision-level classifier fusion for both acoustic and discourse information to classify twelve different DAs in HCRC MapTask data. We obtain accuracy of rate 65.2% (58.06% with cross validation) and 55.4% (51.08% with cross validation) DA classification using acoustic and discourse information respectively. Furthermore, we obtain combined accuracy of 68.6% (61.02% with cross validation) for DA classification. These accuracy rates of DA classification are comparable to previously reported results for the same HCRC MapTask dataset. In terms of average Precision and Recall, we obtain accuracy of 74.89% and 69.83% (without cross validation) respectively. Therefore, we obtain much better precision and recall rate for most of the classified DAs when compared to existing works on the same dataset.

Keywords :

feature extraction; hidden Markov models; pattern classification; speech processing; support vector machines; MapTask data; acoustic features; acoustic information; decision-level classifier fusion; dialog act classification; discourse feature extraction; discourse information; hidden Markov model; parts-of-speech tagging technique; support vector machine; Acoustic measurements; Acoustic testing; Data mining; Emotion recognition; Feature extraction; Hidden Markov models; Humans; Information analysis; Support vector machine classification; Support vector machines;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on

Conference_Location :

Hong Kong

ISSN :

1098-7576

Print_ISBN :

978-1-4244-1820-6

Electronic_ISBN :

1098-7576

Type :

conf

DOI :

10.1109/IJCNN.2008.4633991

Filename :

4633991

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2957475