DocumentCode :
2957475
Title :
Dialog Act classification using acoustic and discourse information of MapTask Data
Author :
Julia, Fatema N. ; Iftekharuddin, Khan M.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Memphis, Memphis, TN
fYear :
2008
fDate :
1-8 June 2008
Firstpage :
1472
Lastpage :
1479
Abstract :
In this work, we analyze both acoustic and discourse information for Dialog Act (DA) classification of HCRC MapTask dataset. We extract several different acoustic features and exploit these features in a Hidden Markov Model (HMM) to classify acoustic information. For discourse feature extraction, we propose a novel parts-of-speech (POS) tagging technique that effectively reduces the dimensionality of discourse features manyfold. To classify discourse information, we exploit two classifiers such as a HMM and a Support Vector Machine (SVM) respectively. We further obtain classifier fusion between HMM and SVM to improve discourse classification. Finally, we perform an efficient decision-level classifier fusion for both acoustic and discourse information to classify twelve different DAs in HCRC MapTask data. We obtain accuracy of rate 65.2% (58.06% with cross validation) and 55.4% (51.08% with cross validation) DA classification using acoustic and discourse information respectively. Furthermore, we obtain combined accuracy of 68.6% (61.02% with cross validation) for DA classification. These accuracy rates of DA classification are comparable to previously reported results for the same HCRC MapTask dataset. In terms of average Precision and Recall, we obtain accuracy of 74.89% and 69.83% (without cross validation) respectively. Therefore, we obtain much better precision and recall rate for most of the classified DAs when compared to existing works on the same dataset.
Keywords :
feature extraction; hidden Markov models; pattern classification; speech processing; support vector machines; MapTask data; acoustic features; acoustic information; decision-level classifier fusion; dialog act classification; discourse feature extraction; discourse information; hidden Markov model; parts-of-speech tagging technique; support vector machine; Acoustic measurements; Acoustic testing; Data mining; Emotion recognition; Feature extraction; Hidden Markov models; Humans; Information analysis; Support vector machine classification; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
ISSN :
1098-7576
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
Type :
conf
DOI :
10.1109/IJCNN.2008.4633991
Filename :
4633991
Link To Document :
بازگشت