Title :
On the robustness of joint optimization on transducer-based decoding graphs
Author :
Abdelhamid, Abdelaziz A. ; Abdulla, Waleed H.
Author_Institution :
Electr. & Comput. Eng., Univ. of Auckland, Auckland, New Zealand
Abstract :
It is our believe that joint optimization of acoustic and language models meets the inherent correlation between them, and thus expected to achieve better recognition performance. This nice approach should be effective in achieving robust speech recognition where the testing conditions are different from those of training. The acoustic and language models are integrated together into a unified decoding graph using weighted finite state transducers. In this paper, we report experimental results of the joint optimization of acoustic and language models on the Resource Management (RM1) continuous speech recognition. The results show that the proposed joint optimization approach is effective under noisy conditions for unseen testing utterances and achieved relative word error rate reduction from 7% to 17% for different noise levels. These results emphasize our expectation about the robustness of the proposed joint optimization approach.
Keywords :
acoustic transducers; correlation methods; decoding; speech recognition; acoustic models; error rate reduction; inherent correlation; joint optimization; language models; resource management; robustness; speech recognition; transducer-based decoding graphs; weighted finite state transducers; Acoustics; Hidden Markov models; Joints; Noise; Robustness; Speech; Speech recognition; Acoustic model; discriminative training; language model; robust speech recognition;
Conference_Titel :
TENCON Spring Conference, 2013 IEEE
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4673-6347-1
DOI :
10.1109/TENCONSpring.2013.6584472