Language identification using multiple knowledge sources

Author

Parris, Eluned S. ; Carey, Michael J.

Author_Institution

Ensigma Ltd., Chepstow, UK

Volume

5

fYear

1995

fDate

9-12 May 1995

Firstpage

3519

Abstract

Language identification experiments have been carried out on language pairs taken from seven of the languages in the OGI Multi-language Telephone Speech Corpus. This builds on previous work but introduces new techniques which are used to exploit the acoustic and phonetic differences between the languages. Subword hidden Markov models for the pair of languages are matched to unknown utterances resulting in three measures: the acoustic match, the phoneme frequencies and frequency histograms. Each of these measures gives 80 to 90% accuracy in discriminating language pairs. However these multiple knowledge sources are also combined to give improved results. Majority decision, logistic regression and a linear classifier were compared as data fusion techniques. The linear classifier performed the best giving an average accuracy of 89 to 93% on the pairs from the seven languages

Keywords

hidden Markov models; natural languages; pattern classification; sensor fusion; speech recognition; acoustic differences; acoustic match; data fusion; frequency histograms; language identification experiment; language pair; linear classifier; logistic regression; majority decision; multiple knowledge sources; phoneme frequencies; phonetic differences; subword hidden Markov models; utterances; Acoustic measurements; Databases; Frequency measurement; Hidden Markov models; Histograms; Logistics; Natural languages; Neural networks; Speech recognition; Telephony;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location

Detroit, MI

ISSN

1520-6149

Print_ISBN

0-7803-2431-5

Type

conf

DOI

10.1109/ICASSP.1995.479745

Filename

479745