Title :
Exploiting query click logs for utterance domain detection in spoken language understanding
Author :
Hakkani-Tür, Dilek ; Heck, Larry ; Tur, Gokhan
Author_Institution :
Speech Labs., Microsoft Res., Mountain View, CA, USA
Abstract :
In this paper, we describe methods to exploit search queries mined from search engine query logs to improve domain detection in spoken language understanding. We propose extending the label propagation algorithm, a graph-based semi-supervised learning approach, to incorporate noisy domain information estimated from search engine links the users click following their queries. The main contributions of our work are the use of search query logs for domain classification, integration of noisy supervision into the semi-supervised label propagation algorithm, and sampling of high-quality query click data by mining query logs and using classification confidence scores. We show that most semi-supervised learning methods we experimented with improve the performance of the supervised training, and the biggest improvement is achieved by label propagation that uses noisy supervision. We reduce the to error rate of domain detection by 20% relative, from 6.2% to 5.0%.
Keywords :
data mining; learning (artificial intelligence); natural language processing; query processing; search engines; speech recognition; domain classification; domain detection; graph based semisupervised learning approach; label propagation algorithm; noisy supervision; query click log; query logs mining; search engine query logs; semisupervised label propagation algorithm; spoken language understanding; utterance domain detection; Entropy; Erbium; Error analysis; Noise measurement; Search engines; Training; Web search; Spoken language understanding; domain detection; label propagation; semi-supervised learning; web search queries;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947638