Language identification using phoneme recognition and phonotactic language modeling

Author

Zissman, M.A.

Author_Institution

Lincoln Lab., MIT, Lexington, MA

Volume

5

fYear

1995

fDate

9-12 May 1995

Firstpage

3503

Abstract

A language identification technique using multiple single-language phoneme recognizers followed by n-gram language models yielded top performance at the March 1994 NIST language identification evaluation. Since the NIST evaluation, work has been aimed at further improving performance by using the acoustic likelihoods emitted from gender-dependent phoneme recognizers to weight the phonotactic likelihoods output from gender-dependent language models. We have investigated the effect of restricting processing to the most highly discriminating n-grams, and we have also added explicit duration modeling at the phonotactic level. On the OGI Multi-language Telephone Speech Corpus, accuracy on an 11-language, closed-set, language identification task has risen to 89% on 45-s utterances and 79% on 10-s utterances. Two-language classification accuracy is 98% and 95% for the 45-s and 10-s utterances, respectively. Finally, we have started to apply these same techniques to the problem of dialect identifications

Keywords

acoustic signal processing; grammars; natural languages; speech processing; speech recognition; telephony; NIST language identification; OGI Multi-language Telephone Speech Corpus; acoustic likelihoods; dialect identification; duration modeling; gender-dependent language models; gender-dependent phoneme recognizers; language identification; n-gram language models; phoneme recognition; phonotactic language modeling; phonotactic likelihoods output; single-language phoneme recognizers; two-language classification accuracy; Acoustic emission; Computational efficiency; Electronic mail; Humans; Laboratories; NIST; Natural languages; Real time systems; Speech recognition; Telephony;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location

Detroit, MI

ISSN

1520-6149

Print_ISBN

0-7803-2431-5

Type

conf

DOI

10.1109/ICASSP.1995.479741

Filename

479741