مرکز منطقه ای اطلاع رساني علوم و فناوري - Use of latent words language models in ASR: A sampling-based implementation

DocumentCode :

1696201

Title :

Use of latent words language models in ASR: A sampling-based implementation

Author :

Masumura, Ryo ; Masataki, Hirokazu ; Oba, Tomohiro ; Yoshioka, Osamu ; Takahashi, Satoshi

Author_Institution :

NTT Media Intell. Labs., NTT Corp., Tokyo, Japan

fYear :

2013

Firstpage :

8445

Lastpage :

8449

Abstract :

This paper applies the latent words language model (LWLM) to automatic speech recognition (ASR). LWLMs are trained taking into account related words, i.e., grouping of similar words in terms of meaning and syntactic role. This means, for example, if a technical word and a general word play a similar syntactic role, they are given a similar probability. This is expected that the LWLM performs robustly over multiple domains. Furthermore, we can expect that the interpolation of the LWLM and a standard n-gram LM will be effective since each of the LMs have different learning criterion. In addition, this paper also describes an approximation method of the LWLM for ASR, in which words are randomly sampled on the LWLM and then a standard word n-gram language model is trained. This enables us one-pass decoding. Our experimental results show that the LWLM performs comparable to the hierarchical Pitman-Yor language model (HPYLM) in a target domain task, and more robustly performs in out-domain tasks. Moreover, an interpolation model with the HPYLM provides a lower word error rate in all the tasks.

Keywords :

approximation theory; computational linguistics; interpolation; natural language processing; sampling methods; signal sampling; speech recognition; ASR; HPYLM; LWLM interpolation; approximation method; automatic speech recognition; hierarchical Pitman-Yor language model; latent words language models; learning criterion; n-gram LM; one-pass decoding; out-domain tasks; sampling-based method; standard word n-gram language model; target domain task; Computational modeling; Decoding; Robustness; Smoothing methods; Speech; Standards; Training; Hierarchical Pitman-Yor language model; Latent words language model; Sampling-based implementation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on

Conference_Location :

Vancouver, BC

ISSN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2013.6639313

Filename :

6639313

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1696201