DocumentCode
179044
Title
Language model adaptation for automatic call transcription
Author
Haznedaroglu, Ali ; Arslan, Levent M.
Author_Institution
Sestek, Istanbul, Turkey
fYear
2014
fDate
4-9 May 2014
Firstpage
4102
Lastpage
4106
Abstract
This paper presents a method of language model adaptation for call-center conversations using automatic speech recognition (ASR) transcripts and their confidence scores. The goal is to select the optimal adaptation set by estimating the recognition errors and minimizing the adaptation language model (LM) perplexity. ASR transcripts are ranked with respect to their confidence scores and adaptation data selection is done iteratively by filtering the most reliable transcript set that minimizes the LM perplexity. Model adaptation is then carried out by interpolating the selected adaptation LM with the baseline in-domain LM. We have evaluated our approach on agent speech of real call-center conversations and experiments show that 4% relative word error rate reduction is achieved with the proposed approach.
Keywords
call centres; interpolation; natural language processing; speech recognition; ASR transcripts; LM perplexity; adaptation data selection; adaptation language model perplexity; agent speech; automatic call transcription; automatic speech recognition transcripts; call-center conversations; confidence scores; language model adaptation; optimal adaptation set; recognition errors; word error rate reduction; Accuracy; Acoustics; Adaptation models; Data models; Hidden Markov models; Speech; Speech recognition; language model adaptation; language modeling; large vocabulary continuous speech recognition; speech analytics;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854373
Filename
6854373
Link To Document