Title :
Automatic language identification with recurrent neural networks
Author :
Braun, Jerome ; Levkowitz, Haim
Author_Institution :
Dept. of Comput. Sci., Massachusetts Univ., Lowell, MA, USA
Abstract :
Automatic language identification (LID), an important domain in speech processing, means the capability of a machine to determine a natural language from a spoken utterance. We present a novel approach to LID, which involves recurrent neural networks (RNN) as the main mechanism. We propose that, because of acoustical context issues, RNNs are particularly suitable for the LID task. Our approach also introduces perceptually guided training (PGT), a novel training method, that exploits the concept of perceptually significant regions (PSR) postulated by our approach. We present our overall approach and describe LIREN/PGT, the have we have developed, implementing our approach. We also discuss our LID experiments with English, German, and Mandarin. In the paper, we concentrate on the architecture and training of RNNs for automatic language identification
Keywords :
acoustic signal processing; learning (artificial intelligence); natural languages; recurrent neural nets; speech recognition; English; German; Mandarin; automatic language identification; natural language; perceptually guided training; perceptually significant regions; recurrent neural networks; spoken utterance; Computer science; Hidden Markov models; Humans; Multimedia systems; Natural languages; Recurrent neural networks; Routing; Speech processing; Speech recognition; Vocabulary;
Conference_Titel :
Neural Networks Proceedings, 1998. IEEE World Congress on Computational Intelligence. The 1998 IEEE International Joint Conference on
Conference_Location :
Anchorage, AK
Print_ISBN :
0-7803-4859-1
DOI :
10.1109/IJCNN.1998.687199