Title :
Discriminative reranking of ASR hypotheses with morpholexical and N-best-list features
Author :
Haşim Sak;Murat Saraçlar;Tunga Güngör
Author_Institution :
Department of Computer Engineering, Boğ
Abstract :
This paper explores rich morphological and novel n-best-list features for reranking automatic speech recognition hypotheses. The morpholexical features are defined over the morphological features obtained by using an n-gram language model over lexical and grammatical morphemes in the first-pass. The n-best-list features for each hypothesis are defined using that hypothesis and other alternate hypotheses in an n-best list. Our methodology is to align each hypothesis with other hypotheses one by one using minimum edit distance alignment. This gives us a set of edit operations - substitution, addition and deletion as seen in these alignments. These edit operations constitute our n-best-list features as indicator features. The reranking model is trained using a word error rate sensitive averaged perceptron algorithm introduced in this paper. The proposed methods are evaluated on a Turkish broadcast news transcription task. The baseline systems are word and statistical sub-word systems which also employ morphological features for reranking. We show that morpholexical and n-best-list features are effective in improving the accuracy of the system (0.8%).
Keywords :
"Feature extraction","Hidden Markov models","Error analysis","Vectors","Training","Acoustics","Decoding"
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Print_ISBN :
978-1-4673-0365-1
DOI :
10.1109/ASRU.2011.6163931