Double bigram-decoding in phonotactic language identification

Author

J. Navratil;W. Zuhlke

Author_Institution

Dept. of Commun. & Meas., Tech. Univ. Ilmanau, Germany

Volume

2

fYear

1997

Firstpage

1115

Abstract

In this paper a phonotactic language identification system that employs a multilingual phone-recognizer with multiple language-dependent grammars to tokenize the spoken signal into several phone-streams is described. For each stream an independent set of language models is used to compute the language scores that are subsequently processed by two classification stages. Thus, the system acquires information from both the original-label and the decoded-phone statistics. A discriminative weighting method is applied in the second stage for better distinguishing between similar languages. A modified language-bigram model, the so-called skip-gram, that allows exploiting of a wider phonotactic context without increasing the estimation costs of a standard bigram, is introduced. Measured on the NIST´95 evaluation set, the described system outperforms the state-of-the-art phonotactic components that use multiple recognizers, and is, at the same time, less computationally expensive.

Keywords

"Decoding","Natural languages","Statistics","Signal processing","Context modeling","Costs","Time measurement","Performance evaluation","Acoustic testing","Automatic testing"

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on

ISSN

1520-6149

Print_ISBN

0-8186-7919-0

Type

conf

DOI

10.1109/ICASSP.1997.596137

Filename

596137