• DocumentCode
    2942413
  • Title

    Language identification using phoneme recognition and phonotactic language modeling

  • Author

    Zissman, M.A.

  • Author_Institution
    Lincoln Lab., MIT, Lexington, MA
  • Volume
    5
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    3503
  • Abstract
    A language identification technique using multiple single-language phoneme recognizers followed by n-gram language models yielded top performance at the March 1994 NIST language identification evaluation. Since the NIST evaluation, work has been aimed at further improving performance by using the acoustic likelihoods emitted from gender-dependent phoneme recognizers to weight the phonotactic likelihoods output from gender-dependent language models. We have investigated the effect of restricting processing to the most highly discriminating n-grams, and we have also added explicit duration modeling at the phonotactic level. On the OGI Multi-language Telephone Speech Corpus, accuracy on an 11-language, closed-set, language identification task has risen to 89% on 45-s utterances and 79% on 10-s utterances. Two-language classification accuracy is 98% and 95% for the 45-s and 10-s utterances, respectively. Finally, we have started to apply these same techniques to the problem of dialect identifications
  • Keywords
    acoustic signal processing; grammars; natural languages; speech processing; speech recognition; telephony; NIST language identification; OGI Multi-language Telephone Speech Corpus; acoustic likelihoods; dialect identification; duration modeling; gender-dependent language models; gender-dependent phoneme recognizers; language identification; n-gram language models; phoneme recognition; phonotactic language modeling; phonotactic likelihoods output; single-language phoneme recognizers; two-language classification accuracy; Acoustic emission; Computational efficiency; Electronic mail; Humans; Laboratories; NIST; Natural languages; Real time systems; Speech recognition; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479741
  • Filename
    479741