Title :
Call-type classification and unsupervised training for the call center domain
Author :
Min Tang ; Pellom, Bryan ; Hacioglu, K.
Author_Institution :
Center for Spoken Language Res., Colorado Univ., Boulder, CO, USA
fDate :
30 Nov.-3 Dec. 2003
Abstract :
We describe recent experiments in call-type classification and acoustic modeling for speech recognition in the call center domain. We first describe the CU Call Center Corpus, a database of human-to-human conversations recorded from an information technology (IT) help desk call center located on the University of Colorado campus. Next, we describe our analysis and labeling of the recorded conversations into a hierarchical taxonomy of the call types. We consider four methods for call-type classification and provide initial experiments illustrating classification error rates for this new task domain. It is shown that lightly supervised training based on using the output from an automatic speech recognizer in conjunction with supervised labeling of calls by call-type can substantially reduce classification error rates and development efforts when only limited training data are available. A call-type classification error rate of 24% is achieved using a classifier based on support vector machines. Finally, we consider issues related to unsupervised acoustic and language model training for improved call transcription and point to directions for future work.
Keywords :
call centres; error statistics; natural language interfaces; pattern classification; speech recognition; speech-based user interfaces; support vector machines; unsupervised learning; acoustic model training; automatic speech recognition; call center; call transcription; call-type classification; classification error rates; hierarchical taxonomy; human-to-human conversations; information technology help desk; language model training; lightly supervised training; support vector machines; unsupervised training; Automatic speech recognition; Databases; Error analysis; Information technology; Labeling; Speech recognition; Support vector machine classification; Support vector machines; Taxonomy; Training data;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN :
0-7803-7980-2
DOI :
10.1109/ASRU.2003.1318429