Title :
Call classification for automated troubleshooting on large corpora
Author :
Evanini, Keelan ; Suendermann, David ; Pieraccini, Roberto
Author_Institution :
Univ. of Pennsylvania, Philadelphia
Abstract :
This paper compares six algorithms for call classification in the framework of a dialog system for automated troubleshooting. The comparison is carried out on large datasets, each consisting of over 100,000 utterances from two domains: television (TV) and Internet (INT). In spite of the high number of classes (79 for TV and 58 for INT), the best classifier (maximum entropy on word bigrams) achieved more than 77% classification accuracy on the TV dataset and 81% on the INT dataset.
Keywords :
entropy; interactive systems; pattern classification; automated large corpora troubleshooting; call classification; dialog system; maximum entropy approach; Boosting; Cities and towns; Entropy; Hardware; Internet; Machine learning algorithms; Natural language processing; Problem-solving; Statistics; TV; automated troubleshooting; call classification; large corpora;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
DOI :
10.1109/ASRU.2007.4430110