DocumentCode
3642469
Title
Double bigram-decoding in phonotactic language identification
Author
J. Navratil;W. Zuhlke
Author_Institution
Dept. of Commun. & Meas., Tech. Univ. Ilmanau, Germany
Volume
2
fYear
1997
Firstpage
1115
Abstract
In this paper a phonotactic language identification system that employs a multilingual phone-recognizer with multiple language-dependent grammars to tokenize the spoken signal into several phone-streams is described. For each stream an independent set of language models is used to compute the language scores that are subsequently processed by two classification stages. Thus, the system acquires information from both the original-label and the decoded-phone statistics. A discriminative weighting method is applied in the second stage for better distinguishing between similar languages. A modified language-bigram model, the so-called skip-gram, that allows exploiting of a wider phonotactic context without increasing the estimation costs of a standard bigram, is introduced. Measured on the NIST´95 evaluation set, the described system outperforms the state-of-the-art phonotactic components that use multiple recognizers, and is, at the same time, less computationally expensive.
Keywords
"Decoding","Natural languages","Statistics","Signal processing","Context modeling","Costs","Time measurement","Performance evaluation","Acoustic testing","Automatic testing"
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-8186-7919-0
Type
conf
DOI
10.1109/ICASSP.1997.596137
Filename
596137
Link To Document