مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

417151

Title :

Meta-data conditional language modeling

Author :

Bacchiani, Michiel ; Roark, Brian

Author_Institution :

AT&T Labs.-Res., USA

Volume :

fYear :

2004

fDate :

17-21 May 2004

Abstract :

Automatic speech recognition (ASR) often occurs in circumstances in which knowledge external to the speech signal, or meta-data, is given. For example, a company receiving a call from a customer might have access to a database record of that customer. Conditioning the ASR models directly on this information to improve the transcription accuracy is hampered because, generally, the meta-data takes on many values and a training corpus has little data for each meta-data condition. The paper presents an algorithm to construct language models conditioned on such metadata. It uses tree-based clustering of the the training data to derive automatically meta-data projections, useful as language model conditioning contexts. The algorithm was tested on a multiple domain voice mail transcription task. We compare the performance of an adapted system aware of the domain shift to a system that only has meta-data to infer that fact. The meta-data used were the caller ID strings associated with the voice mail messages. The meta-data adapted system matched the performance of the system adapted using the domain knowledge explicitly.

Keywords :

learning (artificial intelligence); meta data; natural languages; pattern clustering; speech recognition; trees (mathematics); voice mail; ASR; automatic speech recognition; caller ID strings; meta-data conditional language modeling; training corpus; training data; tree-based clustering; voice mail transcription; Automatic speech recognition; Clustering algorithms; Context modeling; Databases; Entropy; Loudspeakers; Robustness; Testing; Training data; Voice mail;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

0-7803-8484-9

Type :

conf

DOI :

10.1109/ICASSP.2004.1325967

Filename :

1325967

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=417151