• DocumentCode
    3642469
  • Title

    Double bigram-decoding in phonotactic language identification

  • Author

    J. Navratil;W. Zuhlke

  • Author_Institution
    Dept. of Commun. & Meas., Tech. Univ. Ilmanau, Germany
  • Volume
    2
  • fYear
    1997
  • Firstpage
    1115
  • Abstract
    In this paper a phonotactic language identification system that employs a multilingual phone-recognizer with multiple language-dependent grammars to tokenize the spoken signal into several phone-streams is described. For each stream an independent set of language models is used to compute the language scores that are subsequently processed by two classification stages. Thus, the system acquires information from both the original-label and the decoded-phone statistics. A discriminative weighting method is applied in the second stage for better distinguishing between similar languages. A modified language-bigram model, the so-called skip-gram, that allows exploiting of a wider phonotactic context without increasing the estimation costs of a standard bigram, is introduced. Measured on the NIST´95 evaluation set, the described system outperforms the state-of-the-art phonotactic components that use multiple recognizers, and is, at the same time, less computationally expensive.
  • Keywords
    "Decoding","Natural languages","Statistics","Signal processing","Context modeling","Costs","Time measurement","Performance evaluation","Acoustic testing","Automatic testing"
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596137
  • Filename
    596137