Title :
Use of latent words language models in ASR: A sampling-based implementation
Author :
Masumura, Ryo ; Masataki, Hirokazu ; Oba, Tomohiro ; Yoshioka, Osamu ; Takahashi, Satoshi
Author_Institution :
NTT Media Intell. Labs., NTT Corp., Tokyo, Japan
Abstract :
This paper applies the latent words language model (LWLM) to automatic speech recognition (ASR). LWLMs are trained taking into account related words, i.e., grouping of similar words in terms of meaning and syntactic role. This means, for example, if a technical word and a general word play a similar syntactic role, they are given a similar probability. This is expected that the LWLM performs robustly over multiple domains. Furthermore, we can expect that the interpolation of the LWLM and a standard n-gram LM will be effective since each of the LMs have different learning criterion. In addition, this paper also describes an approximation method of the LWLM for ASR, in which words are randomly sampled on the LWLM and then a standard word n-gram language model is trained. This enables us one-pass decoding. Our experimental results show that the LWLM performs comparable to the hierarchical Pitman-Yor language model (HPYLM) in a target domain task, and more robustly performs in out-domain tasks. Moreover, an interpolation model with the HPYLM provides a lower word error rate in all the tasks.
Keywords :
approximation theory; computational linguistics; interpolation; natural language processing; sampling methods; signal sampling; speech recognition; ASR; HPYLM; LWLM interpolation; approximation method; automatic speech recognition; hierarchical Pitman-Yor language model; latent words language models; learning criterion; n-gram LM; one-pass decoding; out-domain tasks; sampling-based method; standard word n-gram language model; target domain task; Computational modeling; Decoding; Robustness; Smoothing methods; Speech; Standards; Training; Hierarchical Pitman-Yor language model; Latent words language model; Sampling-based implementation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639313