DocumentCode
2943081
Title
Language identification using multiple knowledge sources
Author
Parris, Eluned S. ; Carey, Michael J.
Author_Institution
Ensigma Ltd., Chepstow, UK
Volume
5
fYear
1995
fDate
9-12 May 1995
Firstpage
3519
Abstract
Language identification experiments have been carried out on language pairs taken from seven of the languages in the OGI Multi-language Telephone Speech Corpus. This builds on previous work but introduces new techniques which are used to exploit the acoustic and phonetic differences between the languages. Subword hidden Markov models for the pair of languages are matched to unknown utterances resulting in three measures: the acoustic match, the phoneme frequencies and frequency histograms. Each of these measures gives 80 to 90% accuracy in discriminating language pairs. However these multiple knowledge sources are also combined to give improved results. Majority decision, logistic regression and a linear classifier were compared as data fusion techniques. The linear classifier performed the best giving an average accuracy of 89 to 93% on the pairs from the seven languages
Keywords
hidden Markov models; natural languages; pattern classification; sensor fusion; speech recognition; acoustic differences; acoustic match; data fusion; frequency histograms; language identification experiment; language pair; linear classifier; logistic regression; majority decision; multiple knowledge sources; phoneme frequencies; phonetic differences; subword hidden Markov models; utterances; Acoustic measurements; Databases; Frequency measurement; Hidden Markov models; Histograms; Logistics; Natural languages; Neural networks; Speech recognition; Telephony;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479745
Filename
479745
Link To Document