Title :
Automatic language identification using Gaussian mixture and hidden Markov models
Author :
Zissman, Marc A.
Author_Institution :
MIT Lincoln Lab., Lexington, MA, USA
Abstract :
Ergodic, continuous-observation, hidden Markov models (HMMs) were used to perform automatic language classification and detection of speech messages. State observation probability densities were modeled as tied Gaussian mixtures. The algorithm was evaluated on four multilanguage speech databases: a three language subset of the Spoken Language Library, a three language subset of a five-language Rome Laboratory database, the 20-language CCITT database, and the ten-language OGI (Oregon Graduate Institute) telephone speech database. In general, the performance of a single state HMM (i.e., a static Gaussian mixture classifier) was comparable with that of the multistate HMMs, indicating that the sequential modeling capabilities of HMMs were not exploited.<>
Keywords :
hidden Markov models; speech recognition; HMM; algorithm; automatic language classification; detection of speech messages; hidden Markov models; multilanguage speech databases; performance; sequential modeling capabilities; state observation probability densities; tied Gaussian mixtures;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.1993.319323