Title :
Statistical language identification based on untranscribed training
Author :
Lund, Michael A. ; Ma, Kristine ; Gish, Herbert
Author_Institution :
BBN Syst. & Technols., Cambridge, MA, USA
Abstract :
BBN´s baseline language identification (LID) system tokenizes utterances based on an English hidden Markov model (HMM) phone recognizer and uses language-dependent phone-bigram models to discriminate between languages. This is clearly a suboptimal procedure, as English phone models may fail to provide a meaningful tokenization of non-English speech. We address this problem through the use of parametric acoustic segment models derived from untranscribed target-language training. The paper describes some promising exploratory experiments related to model-segment selection and LID based on nearest neighbor classification, as well as an LID system in which language-specific HMMs are trained from unsupervised clustering of parametric acoustic segments. In addition, it describes an experiment in which phone HMMs trained on CallHome Mandarin are added to the baseline system, resulting in an error reduction of 30% on pairwise language discrimination on the OGI-TS corpus
Keywords :
acoustic signal processing; hidden Markov models; natural languages; speech recognition; statistical analysis; CallHome Mandarin; English hidden Markov model; English phone models; HMM phone recognizer; OGI-TS corpus; baseline language identification; baseline system; error reduction; exploratory experiments; language discrimination; language-dependent phone-bigram models; model segment selection; nearest neighbor classification; nonEnglish speech; pairwise language discrimination; parametric acoustic segment models; parametric acoustic segments; statistical language identification; suboptimal procedure; unsupervised clustering; untranscribed target-language training; untranscribed training; Acoustic measurements; Clustering algorithms; Databases; Error analysis; Hidden Markov models; Natural languages; Nearest neighbor searches; Parametric statistics; Speech recognition; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.543240